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NOVEL SURFACE EXPOSED PROTEINS FROM CHLAMYDIA PNEUMONIAE 

The present invention relates to the identification of 
members of a gene family from the human respiratory pathogen 
Chlamydia pneumoniae, encoding surface exposed membrane 
proteins of a size of approximately 89-101 kDa and of 56-57 
kDa, preferably about 89.6-100.3 kDa and about 56.1 kDa. The 
invention relates to the novel DNA sequences, the deduced 
amino acid sequences of the corresponding proteins and the 
use of the DNA sequences and the proteins in diagnosis of 
infections caused by C. pneumoniae, in pathology, in 
epidemiology, and as vaccine components. 

GENERAL BACKGROUND 

C. pneumoniae is an obligate intracellular bacteria 
(Christiansen and Birkelund (1992); Grayston et al . (1986)). 

15 It has a cell wall structure as Gram negative bacteria with 
an outer membrane, a periplasmic space, and a cytoplasmic 
membrane. It is possible to purify the outer membrane from 
Gram negative bacteria with the detergent sarkosyl . This 
fraction is named the 'outer membrane complex (OMC) ' (Caldwell 

20 et al. (1981)). The COMC (Chlamydia outer membrane complex) 
of C. pneumoniae contains four groups of proteins : A high 
molecular weight protein 9 8 kDa as determined by SDS-PAGE, a 
double band of the cysteine rich outer membrane protein 2 
(Omp2) protein of 62/60 kDa, the major outer membrane protein 

25 (MOMP) of 38 kDa, and the low-molecular weight lipo-protein. 
Omp3 of 12 kDa. The Omp2/Omp3 and MOMP proteins are present 
in COMC from all Chlamydia species, and these genes have been 
cloned from both C. trachomatis , C. psittaci and C. 
pneumoniae. However, the gene encoding 9 8 kDa protein from C, 

3 0 pneumoniae COMC have not been characterized or cloned. 

The current state of C. pneumoniae serology and detection 

C. pneumoniae is an obligate intra-cellular bacteria 
belonging to the genus Chlamydia which can be divided into 
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four species: C. trachomatis, C. pneumoniae, C. psittaci and 
C.pecorum. Common for the four species is their obligate 
intra cellular growth, and that they have a biphasic life 
cycle, with an extracellular infectious particle (the 
5 elementary body, EB) , and an intercellular replicating form 
(the reticulate body, RB) . In addition the Chlamydia species 
are characterized by a common lipopolysaccharide (LPS) 
epitope that is highly immunogenic in human infection. C. 
trachomatis is causing the human ocular infection (trachoma) 

10 and genital infections. C. psittaci is a variable group of 
animal pathogens where the avian strains can occasionally 
infect humans and give rise to a severe pneumonia 
(ornithosis) . The first C. pneumoniae isolate was obtained 
from an eye infection, but it was classified as a non-typable 

15 Chlamydia. Under an epidemic outbreak of pneumonia in Finland 
it was realized that the patients had a positive reaction in 
the Chlamydia genus specific test, (the lygranum test), and 
the patients showed a titre increase to the untyped Chlamydia 
isolates. Similar isolates were obtained in an outbreak of 

20 upper respiratory tract infections in Seattle, and the 
Chlamydia isolates were classified as a new species, 
Chlamydia pneumoniae (Grayston et al . (1989)). In addition, 
C. pneumoniae is suggested to be involved in the development 
of atherosclerotic lesions and for initiating bronchial 

25 asthma (Kuo et al . (1995)). These two conditions are thought 
to be caused by either chronic infections, by a 
hypersensitivity reaction, or both. 

Diagnosis of Chlamydia pneumoniae infections 



30 



Diagnosis of acute respiratory tract infection with C. 
pneumoniae is difficult. Cultivation of C. pneumoniae from 
patient samples is insensitive, even when proper tissue 
culture cells are selected for the isolation. A C. pneumoniae 
specific polymerase chain reaction (PCR) has been developed 
by Campbell et al . (1992) . 
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Even though Chlamydia pneumoniae has in several studies been 
detected by this PCR it is debated whether this method is 
suitable for detection under all clinical situations. The 
reason for this is, that the cells carrying Chlamydia 
5 pneumoniae in acute respiratory infections have not been 

determined, and that a chronic carrier state is expected but 
it is unknown in which organs and cells they are present. 
Furthermore/ the PCR test is difficult to perform due to the 
low yield of these bacteria and due to the presence of 

10 inhibitory substances in the patient samples. Therefore, it 
will be of great value to develop sensitive and specific 
sero-diagnostics for detecting both acute and chronic 
infections. Sero-diagnosis of Chlamydia infections is 
currently based on either genus specific tests as the 

15 Lygranum test and ELISA, measuring the antibodies to LPS, or 
the more species specific tests where antibodies to purifier 
EBs are measured by microimmuno fluorescence (Micro- IF) (Wang 
et al. (1970)). However, the micro-IF method is read by 
microscopy, and in order to ensure correct readings the 

20 result must be compared to the results with C. trachomatis 
used as antigen due to the cross -reacting antibodies to the 
common LPS epitope. Thus, there exists in the art an urgent 
need for development of reliable methods for species specific 
diagnosis of Chlamydia pneumoniae, as has been expressed in 

25 Kuo et al. (1995); "..a rapid reliable laboratory test of 

infection for the clinical laboratory is a major need in th^ 
field". Furthermore, the possible involvement of C. 
pneumoniae in atherosclerosis and bronchial asthma clearly 
warrants the development of an effective vaccine. 

3 0 DETAILED DISCLOSURE OF THE INVENTION 

The present invention aims at providing means for efficient 
diagnosis of infections with Chlamydia pneumoniae as well as 
the development of effective vaccines against infection with 
this microorganism. The invention thus relates to species 
3 5 specific diagnostic tests for infection in a mammal, such as 
a human, with Chlamydia pneumoniae, said tests being based on 
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the detection of antibodies against surface exposed membrane 
proteins of a size of approximately 89-101 kDa and of 56-57 
kDa, preferably of about 89.6-100.3 kDa and about 56.1 kDa 
(the range in size of the deduced amino acid sequences was 
5 from 100.3 to 89.6 except for Ompl3 with the size of 56.1 
kDa) , or the detection of nucleic acid fragments encoding 
such proteins or variants or subsequences thereof. The 
invention farther relates to the amino acid sequences of 
proteins according to the invention, to variants and 

10 subsequences thereof, and to nucleic acid fragments encoding 
these proteins or variants or subsequences thereof. The 
present invention further relates to antibodies against 
proteins according to the invention. The invention also 
relates to the use of nucleic acid fragments and proteins 

15 according to the invention in diagnosis of Chlamydia, 
pneumoniae and vaccines against Chlamydia pneumoniae. 

Prior to the disclosure of the present invention only a very 
limited number of genes from C. pneumoniae had been 
sequenced. These were primarily the genes encoding known C. 
trachomatis homologues : MOMP, Omp2, Omp3 , Kdo-transf erase , 
the heat shock protein genes GroEl/Es and DnaK, a 
ribonuclease P homologue and a. gene encoding a 76 kDa protein 
of unknown function. The reason why so few genes have been 
cloned to date is the very low yield of C. pneumoniae which 
can be obtained after purification from the host cells. After 
such purification the DNA must be purified from the EBs , and 
at this step the C. pneumoniae DNA can easily be contaminated 
with host cell DNA. In addition to these inherent 
difficulties, it is exceedingly difficult to cultivate C. 
3 0 pneumoniae and use DNA technology to produce expression 

libraries with very low amounts (few M g) of DNA. It has been 
known since 1993 (Melgosa et al . , 1993) that a 98 kDa protein 
is present in OMC from C. pneumoniae. Even though the protein 
bands of 9 8 kDa was mentioned to be part of the OMC of C. 
3 5 pneumoniae by Melgosa, the gene sequences and thus the 

deduced amino acid sequences have not been determined. Only 
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bands originating from Chlamydia, pneumoniae proteins in 
general separated by SDS-PAGE are describe therein. 
However, the gene encoding this protein has not been 
determined before the present invention. Only a very weak or 

5 no reaction with patient sera can be observed to the 98 kDa 
protein (Campbell et al . 1990) and prior to the work of the 
present inventors it has not been recognized that the 89-101 
kDa proteins are surface exposed or that they in fact is 
immunogenic. In this report it is described that a number of 

0 human serum samples reacts with a C. pneumoniae protein that 
in SDS-PAGE migrate as 98 kDa. The protein was not further 
characterized and it is therefore not in conflict with the 
present application. 



Halme et al . (1997) described the presence of human T-cell 
epitopes in C. pneumoniae proteins of 92-98 kDa. The protein;:, 
were eluted from SDS-PAGE of total chlamydia proteins but the 
identity of the proteins were not determined. 



Use of antibodies to screen expression libraries is a well 
known method to clone fragments of genes encoding antigenic 
20 parts of proteins. However, since patient sera do not show a 
significant -reaction with the 98 kDa protein it has not been 
possible to use patient serum to clone the proteins. 

It was known that monoclonal antibodies generated by the 
25 inventors reacted with conformational epitopes on the surface 
of C. pneumoniae and that they also reacted with C. 
pneumoniae OMC by immuno-electron microscopy (Christiansen et 
al. 1994). Furthermore, the 98 kDa protein is the only 
unknown protein from the C. pneumoniae OMC (Melgosa et al . 
1993) . The present inventors chose to take an unconventional 
step in order to clone the gene encoding the hitherto unknown 
9 8 kDa protein: C. pneumoniae OMC was purified and the highly 
immunogenic conformational epitopes were destroyed by SDS- 
treatment of the antigen before immunization. Thereby an 
35 antibody (PAB 150) to less immunogenic linear epitopes was 
obtained. This provided the possibility to obtain an 
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antiserum which could detect the protein, and it was shown 
that, a gene family encoding the 89-101 kDa and 56 proteins 
according to the invention could be detected in colony 
blotting of recombinant E. coli. 

Mice infected with C. pneumoniae generate antibodies to the 
proteins identified by the inventors and named Omp4-15, but 
do not recognize the SDS treated heat denatured antigens 
normally used for SDS-PAGE and immunoblotting . However, a 
strong reaction was seen if the antigen was not heat 
denatured. It is therefore highly likely that if a similar 
reaction is seen in connection with human infections the 
antigens of the present invention will be of invaluable use 
in sero-diagnostic tests and may very likely be used as a 
vaccine for the prevention of infections. 



By generating antibodies against CONIC from C. pneumoniae a 
polyclonal antibody ( PAB 150) was obtained which reacted with 
all the proteins. This antibody was used to identify the 
genes encoding the 8 9.6-101.3 kDa and 56.1 kDa proteins in an 

2 0 expression library of C. pneumoniae DNA. A problem in 

connection with the present invention was that a family 
comprising a number of similar, genes were found in C. 
pneumoniae. Therefore, a large number of different clones 
were required to identify clusters of fragments. Only because 
25 the rabbit antibody generated by the use of SDS-denatured 

antigens contained antibodies to a high number of different 
epitopes positioned on different members of the protein 
family did the inventors succeed in cloning and sequencing ' 
four of the genes. One gene was fully sequenced, a second was 

3 0 sequenced except for the distal part and shorter fragments of 

two additional genes were obtained by this procedure. To 
obtain the DNA sequence of the additional genes and to search 
for more members of the gene family long range PCR with 
primers derived from the sequenced genes, and primers from 
3 5 the genes already published in the database were used. This 

approach gave rise to the detection of additional eight genes 
belonging to this family. The genes were situated in two gene 



SUBSTITUTE SHEET (RULE 26) 



WO 98/58953 




PCT/DK98/00266 



clusters: Ompl2 , 11 , 10 , 5 , 4 , 13 and 14 in one cluster and 
Omp6, 7,8,9 and 15 in the second. Full sequence was obtained 
from Omp4, 5", S, 7, 8, 9,10,11 and 13, and partial sequence of 
Ompl2,14. Ompl3 was a truncated gene of 1545 nucleotides. The 
5 rest of the full length genes were from 2526 (Omp7) to 2838 
(Ompl5) nucleotides. The deduced amino acid sequences 
revealed putative polypeptides of 89.6 to 100.3 kDa, except 
for Ompl3 of 56.1 kDa. Alignment of the deduced amino acid 
sequences showed a maximum identity of 49% (Omp5/Omp9) when 
10 all the sequences were compared. Except for 0mpl3 , the lowest 
homology was to Omp7 with no more than 34% identity to any of 
the other amino acid sequences. The scores for Ompl3 was from 
29-32% to all the other sequences. 

In the present context SEQ ID Nos. 1 and 2 correspond to 
15 Omp4, SEQ ID Nos 3 and 4 correspond to Omp5 , SEQ ID Nos 5 a-\:} 
6 correspond to Omp6, SEQ ID Nos 7 and 8 correspond to Omp7, 
SEQ ID Nos 9 and 10 correspond to 0mp8, SEQ ID Nos 11 and 12 
correspond to Omp9, SEQ ID Nos 13 and 14 corresponds to 
OmplO, SEQ ID Nos 15 and 16 corresponds to Ompll, SEQ ID Nos 
20 17 and 18 corresponds to Ompl2, SEQ ID Nos 19 and 20 

corresponds to Ompl3, SEQ ID Nos 21 and 22 corresponds to 
Ompl4, and .SEQ ID Nos 23 and 24 corresponds to OmplS . 

The estimated size of the Omp proteins of the of the present 
invention are listed in the following. Omp 4 has a size of 

25 98.9 kDa, Omp5 has an estimated size of 97.2 kDa, Omp6 has an 
estimated size of 100.3 kDa, Omp7 has an estimated size of 
8 9.7 kDa, Omp8 has an estimated size of 9 0.0 kDa, Omp9 has an 
estimated size of 96.7 kDa, OmplO has an estimated size of 
98.4 kDa, Ompll has an estimated size of 97.6 kDa, 0mpl3 has 

30 an estimated size of 56.1 kDa, Omp 12 and 14 being partial. 

Furthermore, SEQ ID No 25 is a subsequence of SEQ ID No 3 , 
SEQ ID No 26 is a subsequence of SEQ ID No 4 , SEQ ID No 27 is 
a subsequence of SEQ ID No 5 , SEQ ID No 2 8 is a subsequence 
of SEQ ID No 6, SEQ ID No 2 9 is a subsequence of SEQ ID No 7, 
35 and SEQ ID No 30 is a subsequence of SEQ ID No 8. 
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Part of the omp proteins were expressed as fusion proteins, 
and mice polyclonal monospecific antibodies against the 
proteins were produced. The antibodies reacted with the 
surface of C. pneumoniae in both immunofluorescence and 
immunoelectron microscopy. This shows for the first time that 
the 89-101 JcDa and 56-57 kDa protein family in c. pneumoniae 
comprises surface exposed outer membrane proteins. This 
important finding leads to the realization that members of 
the 89-101 JcDa and 56-57 kDa C. pneumoniae protein family are 
good candidates for the development of a sero diagnostic test 
for C. pneumoniae, as well as the development of a vaccine 
against infections with C. pneumoniae based on using these 
proteins. Furthermore, the proteins may be used as 
epidemiological markers, and polyclonal monospecific sera 
15 against the proteins can be used to detect C. pneumoniae in 
human tissue or detect C. pneumoniae isolates in tissue 
culture. Also, the genes encoding the 89-101 kDa and 56-57 
kDa such as the 89.6-100.3 kDa and 56.1 protein family may be 
used for the development of a species specific diagnostic 
test based on nucleic acid detection/amplification. 



The full length Omp4 was cloned into an expression vector 
system that allowed expression of the Omp4 polypeptide. This 
polypeptide was used as antigen for immunization of a rabbit. 
Since the protein was purified under denaturing condition the 
25 antibody did not react with the native surface of C. 
pneumoniae, but it reacted with a 98 kDa protein in 
immunoblotting where purified C. pneumoniae EB was used as 
antigen. Furthermore, the antibody reacted in paraffin 
embedded sections of lung tissue from experimentally infected 



30 mice . 



A broad aspect of the present invention relates to a species 
specific diagnostic test for infection of a mammal, such as a 
human, with Chlamydia pneumoniae, said test comprising 
detecting in a patient or preferable in a patient sample the 
3 5 presence of antibodies against proteins from the outer 

membrane of Chlamydia pneumoniae, said proteins being of a 
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molecular weight of 89-101 kDa or 56-57 kDa, or detecting the 
presence of nucleic acid fragments encoding said outer 
membrane proteins or fragments thereof. 

5 In the context of the present application, the term "patient 
sample" should be taken to mean an amount of serum from a 
patient, such as a human patient, or an amount of plasma from 
said patient, or an amount of mucosa from said patient, or an 
amount of tissue from said patient, or an amount of 

10 expectorate, forced sputum or a bronchial aspirate, an amount 
of urine from said patient, or an amount of cerebrospinal 
fluid from said patient, or an amount of atherosclerotic 
lesion from said patient, or an amount of mucosal swaps from 
said patient, or an amount of cells from a tissue culture 

15 originating from said patient, or an amount of material which 
in any way originates from said patient. The in vivo test in 
a human according to the present invention includes a skin 
test known in the art such as an intradermal test, e.g 
similar to a Mantaux test. In certain patients being very 

20 sensitive to the test, such as is often the case with 
children, he test could be non- invasive , such as a 
superficial test on the skin, e.g. by use of a plaster 

In the present context, the term 89-101 kDa protein means 
proteins normally present in the outer membrane of Chlamydia 

2 5 pneumoniae, which in SDS-PAGE can be observed as one or more 

bands with an apparent molecular weight substantially in th.j 
range of 89-101 kDa. From the deduced amino acid sequences 
the molecular size varies from 89.6 to 100.3 kDa. 

Within the scope of the present invention are species 

3 0 specific sero-diagnostic tests based on the usage of the 

genes belonging to the gene family disclosed in the present 
application . 

Preferred embodiments of the present invention relate to 
species specific diagnostic tests according to the invention, 
3 5 wherein the outer membrane proteins have sequences selected 
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from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID 
NO: 22, and SEQ ID NO: 24. 



10 
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When used in connection with proteins according to the 
present invention the term "variant" should be understood as 
a sequence of amino acids which shows a sequence similarity 
of less than 100% to one of the proteins of the invention. A 
variant sequence can be of the same size or it can be of a 
different size as the sequence it is compared to. A variant 
will typically show a sequence similarity of preferably at 
least 50%, preferably at least 60%, more preferably at least 
70%, such as at least 80%, e.g. at least 90%, 95% or 98%. 

The term "sequence similarity" in connection with sequences 
of proteins of the invention means the percentage of 
identical and conservatively changed amino acid residues 
(with respect to both position and type) in the proteins of 
the invention and an aligned protein of equal of different 
length. The term "sequence identity" in connection with 
sequences of proteins of the invention means the percentage 
of identical amino acid with respect to both position and 
type in the proteins of the invention and an aligned protein 
of equal of different length. 

Within the scope of the present invention are subsequences of 
25 one of the proteins of the invention, meaning a consecutive 
stretch of amino acid residues taken from SEQ ID NO: 2, SEQ' 
ID NO: 4, SEQ ID NO: 6, SEQ ID NO : 8, SEQ ID NO: 10, SEQ ID 
NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22 , or SEQ ID NO : 24. A subsequence will 
typically comprise at least 100 amino acids, preferably at 
least 80 amino acids, more preferably at least 70 amino 
acids, such as 5 0 amino acids. It might even be as small as 
10-50 amino acids, such as 20-40 amino acids, e.g. about 30 
amino acids. A subsequence will typically show a sequence 
3 5 homology of at least 50%, preferably at least 6 0%, more 
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preferably at least 70%, such as at least 80%, e.g. at least 
90%, 95% or 98%. 

Diagnostic tests according to the invention include 
immunoassays selected from the group consisting of a direct 
5 or indirect EIA such as an ELISA, an immunoblot technique 
such as a Western blot, a radio immuno assay, and any other 
non-enzyme linked antibody binding assay or procedure such as 
a fluorescence, agglutination or precipitation reaction, and 
nephelometry . 

10 A preferred embodiment of the present invention relates to 

species specific diagnostic tests according to the invention, 
said test comprising an ELISA, wherein antibodies against the 
proteins of the invention or fragments thereof are detected 
in samples . 

15 A preferred embodiment of the invention, is an ELISA based on 
detection in samples of antibodies against proteins of the 
invention. The ELISA may use proteins of the invention, or 
variants thereof, i.e. the antigen, as coating agent. An 
ELISA will typically be developed according to standard 

20 methods well known in the art,. such as methods described in 
"Antibodies; a laboratory manual", Ed. David Lane Harlow, 
Cold Spring Habor laboratories (19 88) , which is hereby 
incorporated by reference. 

Recombinant proteins will be produced using DNA sequences 
25 obtained essentially using methods described in the examples 
below. Such DNA sequences, comprising the entire coding 
region of each gene in the gene family of the invention, will 
be cloned into an expression vector from which the deduced 
protein sequence can be purified. The purified proteins will 
30 be analyzed for reactivity. in ELISA using both monoclonal and 
polyclonal antibodies as well as sera from experimentally 
infected mice and human patient sera. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/58953 




PCT/DK98/00266 



From the experimentally infected mice sera it is known that 
non -.linear epitopes are recognized predominantly. Thus, it is 
contemplated that different forms of purification schemes 
known in the art will be used to analyze for the presence of 
discontinuous epitopes, and to analyze whether the human 
immune response is also directed against such epitopes. 

Preferred embodiments of the present invention relate to 
species specific diagnostic tests according to the invention, 
wherein the nucleic acid fragments have sequences selected 
from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
NO: 21, and SEQ ID NO: 23. 



In connection with nucleic acid fragments according to the 
present invention the term "variant" should be understood as 
a sequence of nucleic acids which shows a sequence homology 
of less than 100%. A variant sequence can be of the same size 
or it can be of a different size as the sequence it is 
compared to. A variant will typically show a sequence 
homology of at least 50%, preferably at least 60%, more 
preferably at least 70%, such as at least 80%, e.g. at least 
90%, 95% or 98%. 

The term "sequence homology" in connection with nucleic acid 
fragments of the invention means the percentage of matching 
nucleic acids (with respect to both position and type) in the 
nucleic acid fragments of the invention and an aligned 
nucleic acid fragment of equal or different length. 

In order to obtain information concerning the general 
distribution of each of the genes according to the present 
invention, PCR will be performed for each gene on all 
available C. pneumoniae isolates. This will provide 
information on the general variability of the genes or 
nucleic acid fragments of the invention. Variable regions 
will be sequenced. From patient samples PCR will be used to 
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amplify variable parts of the genes for epidemiology. Non- 
variable parts will be used for amplification by PCR and 
analyzed for possible use as a diagnostic test. It is 
contemplated that if variability is discovered, PCR of 
5 variable regions can be used for epidemiology. PCR of non- 
variable regions can be used as a species specific diagnostic 
test. Using genes encoding proteins known to be invariable in 
all known isolates prepared as targets for PCR to genes 
encoding proteins with unknown function. 

10 Particularly preferred embodiments of the present invention, 
relate to diagnostic tests according to the invention, 
wherein detection of nucleic acid fragments is obtained by 
using nucleic acid amplification, preferably polymerase chain 
reaction (PCR) . 

15 Within the scope of the present invention is a PCR based test 
directed at detecting nucleic acid fragments of the invention 
or variants thereof. A PCR test will typically be developed 
according to methods well known in the art and will typically 
comprise a PCR test capable of detecting and differentiating 

20 between nucleic acid fragments of the invention. Preferred 
are quantitative competitive PCR tests or nested PCR tests. 
The PCR test according to the invention will typically be 
developed according to methods described in detail in EP B 
540 588, EP A 586 112, EP A 643 140 OR EP A 669 401, which 

2 5 are hereby incorporated by reference. 

Within the scope of the present invention are variants and 
subsequences of one of the nucleic acid fragments of the 
invention, meaning a consecutive stretch of nucleic acids 
taken from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

3 0 NO: 7, SEQ ID NO : 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 

15, SEQ ID NO: 19, SEQ ID NO: 21, or SEQ ID NO : 23. A variant 
or subsequence will preferably comprise at least 100 nucleic 
acids, preferably at least 80 nucleic acids, more preferably 
at least 70 nucleic acids, such as at least 50 nucleic acids. 
35 It might even be as small as 10-50 nucleic acids, such as 
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20-40 nucleic acids, e.g. about 30 nucleic acids. A 
subsequence will typically show a sequence homology of at 
least 30%, preferably at least 60%, more preferably at least 
70%, such as at least 80%, e.g. at least 90%, 95% or 98%. The 
shorter the subsequence, the higher the required homology. 
Accordingly, a subsequence of 100 nucleic acids or lower must 
show a homology of at least 80%. 

A very important aspect of the present invention relates to 
proteins of the invention derived from Chlamydia pneumoniae 
having amino acid sequences selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, and SEQ 
ID NO: 24 having a sequence similarity of at least 50%, 
preferably at least 60%, more preferably at least 70%, such 
as at least 80%, e.g. at least 90%, 95% or 98% and a similar 
biological function . 

By the term "similar biological function" is meant that the 
protein shows characteristics similar with the proteins 
derivable from the membrane proteins of Chlamydia pneumoniae. 
Such proteins comprise repeated motifs of GGAI (at least 2, 
preferable at least 3 repeats) and/or conserved positions of 
tryptophan, (w) . 

Comparison of the DNA sequences from genes encoding Omp4-15 
shows that the overall similarity between the individual 
genes ranges between 43-55%. Comparison of the amino acid" 
sequences of Omp4-15 shows 34-49% identity and 53-64% 
similarity. The homology is generally scattered along the 
entire length of the deduced amino acids. However, as seen 
from figure 8 A - J there are some regions in which the 
homology is more pronounced. This is seen in the repeated 
sequence where the sequence GGAI is repeated 4-7 times in the 
genes. It is interesting that the DNA homology is not 
conserved for the sequences encoding the four amino acids 
GGAI . This may indicate a functional role of this part of the 
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protein and indicates that the repeated structure did not 
occur by a duplication of the gene. In addition to the four 
amino acid repeats GGAI a region from amino acid 400 to 490 
has a higher degree of homology than the rest of the protein, 
5 with the conserved sequence FYDPI occurring in all sequences. 
As further indication of similarity in function the amino 
acid tryptophan (W) is perfectly conserved at 4-6 
localizations in the C- terminal part of the protein. 

Since none of the genes and deduced amino acid sequences of 
10 the invention are identical the following is within the scope 
of the present invention; production of monospecific 
antibodies, the use of said antibodies for characterizing 
which C. pneumoniae proteins are expressed, the use of said 
antibodies for characterizing at which time during 
15 developmental life cycle said C. pneumoniae proteins are 

expressed, and the use of said antibodies for characterizing 
the precise cellular localization of said C. pneumoniae 
proteins. Also within the scope of the present invention is 
the use of monospecific antibodies against proteins of the 
20 invention for determining which part of said proteins is 

surface exposed and how proteins in the C. pneumoniae COMC 
interact with each other. 

Preferred embodiments of the present invention relate to 
25 polypeptides which comprise subsequences of the proteins of 
the invention, said subsequences comprising the sequence 
GGAI . Further preferred embodiments of the present invention 
relate to polypeptides which comprise subsequences of the 
proteins of the invention, said subsequences comprising the 
3 0 sequence FSGE . 

Polypeptides according to the invention will typically be of 
a length of at least 6 amino acids, preferably at least 15 
amino acids, preferably at least 20 amino acids, preferably 
at least 25 amino acids, preferably at least 3 0 amino acids, 
3 5 preferably at least 3 5 amino acids, preferably at least 40 
amino acids, preferably at least 45 amino acids, preferably 
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at least 50 amino acids, preferably at least 55 amino acids, 
preferably at least 100 amino acids. 

A very important aspect of the present invention relates to 
nucleic acid fragments of the invention derived from 
Chlamydia, pneumoniae, variants and subsequences thereof. 

Another important aspect of the present invention relates to 
antibodies against the proteins according to the invention, 
such antibodies including polyclonal monospecific antibodies 
and monoclonal antibodies against proteins with sequences 
selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 
20, SEQ ID NO: 22, and SEQ ID NO: 24. 

A very important aspect of the present invention relates to 
diagnostic kits for the diagnosis of infection of a mammal, 
such as a human, with Chlamydia pneumoniae, said kits 
comprising one or more proteins with amino acid sequences 
selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 
12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 
20, SEQ ID NO: 22, and SEQ ID NO: 24. 

Another very important aspect of the present invention 
relates to diagnostic kits for the diagnosis of infection of 
a mammal, such as a human, with Chlamydia pneumoniae, said 
kits comprising antibodies against a protein with an amino ' 
acid sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO : 4, SEQ ID NO : 6, SEQ ID NO : 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 20, SEQ ID NO: 22, and SEQ ID NO: 24. 
Antibodies included in a diagnostic kit according to the 
invention can be polyclonal or monoclonal or a mixture 
hereof . 
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Still another very important aspect of the present invention 
relates to diagnostic kits for the diagnosis of infection of 
a mammal, such as a human, with Chlamydia pneumoniae, said 
kits comprising one or more nucleic acid fragments with 
sequences selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO : 9, SEQ 
ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID NO: 19, SEQ ID NO: 21, and SEQ ID NO: 23. 

An aspect of the present invention relates to a composition 
for immunizing a mammal, such as a human, against Chlamydia 
pneumoniae, said composition comprising one or more proteins 
with amino acid sequences selected from the group consisting 
Of SEQ ID NO: 2, SEQ ID NO : 4, SEQ ID NO: 6, SEQ ID NO : 8, 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, 
SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, and SEQ ID NO: 
24 . 

An important role for the proteins of the invention in 
prevention of infection of a mammal, such as a human, with C. 
pneumoniae is expected. Thus proteins of the invention, 
including variants and subsequences will be produced, 
typically by using recombinant -techniques, and will then be 
used as an antigen in immunization of mammals, such as 
rabbits. Subsequently, the hyper immune sera obtained by the 
immunization will be analyzed for protection against C. 
pneumoniae infection using a tissue culture assay. In 
addition it is contemplated that monoclonal antibodies will 
be produced, typically using standard hybridoma techniques, 
and analyzed for protection against infection with C. 
pneumoniae . 

It is envisioned that particularly interesting and 
immunogenic epitopes will be found in connection with the 
proteins of the invention, which will comprise subsequences 
of said proteins. It is preferred to use polypeptides 
comprising such subsequences of the proteins of the invention 
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in immunizing a mammal, such as a human, against Chlamydia 
pneumoniae . 

An important aspect of the present invention relates to the 
use of proteins with sequences selected from the group 
5 consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 

ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ- ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, and SEQ 
ID NO: 24 in diagnosis of infection of a mammal, such as a 
human, with Chlamydia pneumoniae. 

10 A preferred embodiment of the present invention relates to 
the use of proteins according to the invention in an 
undenatured form, in diagnosis of infection of a mammal, such 
as a human, with Chlamydia pneumoniae. 

A very important aspect of the present invention relates to 
15 the use of proteins with sequences selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, and SEQ 
ID NO: 24, for immunizing a mammal, such as a human, against 
20 Chlamydia pneumoniae. 

A preferred embodiment of the present invention relates to 
the use of proteins according to the invention in an 
undenatured form, for immunizing a mammal, such as a human, 
against Chlamydia pneumoniae. 



25 
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A very important aspect of the present invention relates to 
the use of nucleic acid fragments with nucleotide sequences 
selected from the group consisting of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 
11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 
19, SEQ ID NO: 21, and SEQ ID NO: 23 for immunizing a mammal 
such as a human, against Chlamydia pneumoniae. 
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It is envisioned that one type of vaccine against C. 
pneumoniae will be developed by using gene-gun vaccination of 
mice. Typically, different genetic constructs containing 
nucleic acid fragments, combinations of nucleic acid 
fragments according to the invention will be used in the 
gene -gun approach. The mice will then subsequently be 
analyzed for production of both humoral and cellular immune 
response and for protection against infection with C. 
pneumoniae after challenge herewith. 

In line with this, the invention also relates to the uses of 
the proteins of the invention as a pharmaceutical (a vaccine) 
as well as to the uses thereof for the preparation of a 
vaccine against infections with Chlamydia, pneumoniae. 

Preparation of vaccines which contain protein sequences as 
active ingredients is generally well understood in the art, 
as exemplified by U.S. Patents 4,608,251; 4,601,903; 
4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorpor- 
ated herein by reference. Typically, such vaccines are pre- 
pared as injectables either as liquid solutions or suspen- 
sions; solid forms suitable for solution in, or suspension 
in, liquid prior to injection may also be prepared. The 
preparation may also be emulsified. The active immunogenic 
ingredient is often mixed with excipients which are pharma- 
ceutieally acceptable and compatible with the active ingredi 
ent. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol, or the like, and combinations 
thereof. In addition, if desired, the vaccine may contain 
minor amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, or adjuvants which 
enhance the effectiveness of the vaccines. 



The vaccines are conventionally administered parenterally , by 
injection, for example, either subcutaneously or intramuscu- 
larly. Additional formulations which are suitable for other 
modes of administration include suppositories and, in some 
3 5 cases, oral f ormulat ions . These compositions take the form of 
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solutions, suspensions, tablets, pills, capsules, sustained 
release formulations or powders and contain 10-95% of active 
ingredient, preferably 25-70%, and optionally a suitable 
carrier . 



The protein sequences may be formulated into the vaccine as 
neutral or salt forms known in the art . The vaccines are 
administered in a manner compatible with the dosage 
formulation, and in such amount as will be therapeutically 
effective and immunogenic. The quantity to be administered 
depends on the subject to be treated. Suitable dosage ranges 
are of the order of several hundred micrograms active 
ingredient per vaccination with a preferred range from about 
0.1 /ig to 1000 pig. The immune response may be enhanced if the 
vaccine further comprises an adjuvant substance as known in 
the art. Other possibilities involve the use of 
immunomodulating substances such as lymphokines (e.g. I FN - y , 
IL-2 and 11,-12) or synthetic IFN-7 inducers such as poly I;C 
in combination with the above-mentioned adjuvants. 

It is also possible to produce a living vaccine by introdu- 
cing, into a non-pathogenic microorganism, at least one 
nucleic acid fragment encoding a protein fragment or protein 
of the invention, and effecting expression of the protein 
fragment or the protein on the surface of the microorganism 
(e.g. in the form of a fusion protein including a membrane 
anchoring part or in the form of a slightly modified protein 
or protein fragment carrying a lipidation signal which allows 
anchoring in the membrane) . The skilled person will know how 
to adapt relevant expression systems for this purpose. 

Another part of the invention is based on the fact that 
recent research have revealed that a DNA fragment cloned in a 
vector. which is non-replicative in eukaryotic cells may be 
introduced into an animal (including a human being) by e.g. 
intramuscular injection or percutaneous administration (the 
so-called "gene gun" approach) . The DNA is taken up by e.g. 
muscle cells and the gene of interest is expressed by a 
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promoter which is functioning in eukaryotes, e.g. a viral 
promoter, and the gene product thereafter stimulates the 
immune system. These newly discovered methods are reviewed in 
Ulmer et al . , 1993, which hereby is included by reference. 

Thus, a nucleic acid fragment encoding a protein or protein 
of the invention may be used for effecting in vivo expression 
of antigens, i.e. the nucleic acid fragments may be used in 
so-called DNA vaccines. Hence, the invention also relates to 
a vaccine comprising a nucleic acid fragment encoding a 
protein fragment or a protein of the invention, the vaccine 
effecting in vivo expression of antigen by an mammal, such as 
a human, to whom the vaccine has been administered, the 
amount of expressed antigen being effective to confer 
substantially increased resistance to infections with 
Chlamydia, pneumoniae in an mammal, such as a human. 

The efficacy of such a "DNA vaccine" can possibly be enhanced 
by administering the gene encoding the expression product 
together with a DMA fragment encoding a protein which has the 
capability of modulating an immune response. For instance, a 
gene encoding lymphokine precursors or lymphokines (e.g. IFN- 
y, IL.-2, or IL-12) could be administered together with the 
gene encoding the immunogenic protein fragment or protein, 
either by administering two separate DNA fragments or by 
administering both DNA fragments included in the same vectc 
It is also a possibility to administer DNA fragments compri- 
sing a multitude of nucleotide sequences which each encode 
relevant epitopes of the protein fragments and proteins 
disclosed herein so as to effect a continuous sensitization 
of the immune system with a broad spectrum of these epitopes. 

The following experimental non- limiting examples are intenc-: 
to illustrate certain features and embodiments of the inven- 
tion . 
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LEGENDS TO FIGURES 

Figure 1 . The figure shows electron microscopy of negative 
stained purified C. pneumoniae EB (A) and purified OMC (B) 

Figure 2. The figure shows silver stained 15% SDS-PAGE of 
5 purified EB and OMC . Lane 1, purified C. pneumoniae EB; lane 
2, C. pneumoniae OMC; lane 3, purified C. trachomatis EB; and 
lane 4 C. trachomatis OMC. 

Figure 3 . The figure shows immunoblotting of C. pneumoniae EB 
separated by 10% SDS-PAGE , transferred to nitrocellulose and 
reacted with rabbit anti C. pneumoniae OMC. 



10 



15 



20 



Figure 4. The figure shows coomassie blue stained 7.5% 
SDS-PAGE of recombinant pEX that were detected by the rabbit 
anti C. pneumoniae serum. Arrow indicated the localization of 
the 117 kDa b-galactosidase protein. 

Figure 5. The figure shows immunoblotting of recombinant pEX 
colones detected by colony blotting separated by 7.5% 
SDS-PAGE and transferred to nitrocellulose and reacted with 
rabbit anti C. pneumoniae OMC. Lane 1, seablue molecular 
weight standard. Lane 2-6 pEX clones cultivated at 42°C to 
induce the production of the b-galactosidase fusion proteins. 

Figure 6. The figure shows sequence strategy for 0mp4 and 
Omp5 . Arrows indicates primers used for sequencing. 

Figure 7. C pneumoniae omp genes. The genes are arranged in 
two clusters. In cluster 1 Ompl2 / 11, 10, 5, 4, 13, and 14 
2 5 are found. In cluster 2 are found Omp6 , 7, 8, 9, and 15. 

Figure 8 A - J. The figure shows alignment of C. pneumoniae 
Omp4-15, using the program pileup in the GCG package. 

Figure 9. The figure shows immunofluorescence of C. 
pneumoniae infected HeLa, 72 hrs . after infection, reacted 
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with mouse monospecific anti-serum against pEX3-36 fusion 
protein. pEX3-3 6 is a part of the Omp5 gene. 

Figure 10. The figure shows immunoblotting of C. pneumoniae 
EB, lane 1-3 heated to 100°C in SDS-sample buffer, lane 4-6 
5 unheated. Lane 1 reacted with rabbit anti C. pneumoniae OMC; 
lane 2 and 4 pre -serum; lane 3 and 5 polyclonal rabbit anti 
pEXl-1 fusion protein; lane 6 MAb 26.1. 

Figure 11. The figure shows immunoblotting of C. pneumoniae 
EB, lane 1-4 heated to lOOoC in SDS-sample buffer, lane 5-6 
10 unheated. Reacted with serum from C57 -black mice 14 days 

after infection with 10 7 CFU of c. pneumoniae. Lane 1 and 5 
mouse 1; lane 2 and 6 mouse 2; lane 3 and 5 mouse 3; and lane 
4 and 8 mouse 4 . 

Figure 12 . The figure shows immunohistochemistry analysis of 
15 mouse lung tissue with C. pneumoniae inclusions present both 
in the bronchial epithelium and in the lung parenchyma 
(arrows) . 



SUBSTITUTE SHEET (RULE 26) 



WO 98/58953 
EXAMPLE 1 



24 



PCT/DK98/00266 



Cloning of the genes encoding the 98/95 kDa C. pneumoniae 
COMC proteins 

Purification of C. pneumonia EEs and COMC 

C. pneumoniae was cultivated in HeLa cells. Cultivation was 
done according to the specifications of Miyashita and 
Matsumoto (1992), with the modification that centrif ugation 
of supernatant and of the later precipitate and turbid bottom 
layer was carried out at 100,000 X g. The microorganism 
attached to the HeLa cells by 3 0 minutes of centrif ugation at 
1000 x g, after which the cells were incubated in RPMI 1640 
medium (Gibco BRL , Germany cat No. 51800-27), containing 5% 
foetal calf serum (FCS, Gibco BRL, Germany Cat No. 10106.169) 
gentamicin for two hours at 37°C in 5% C02 atmosphere. The 
medium was changed to medium that in addition contained 1 mg 
per ml of cycloheximide . After 48 hours of incubation a 
coverslip was removed from the cultures and the inclusion was 
tested with an antibody specific for c. pneumoniae (MAb 26.1) 
(Christiansen et al . 1994) and a monoclonal antibody specific 
for the species C. trachomatis (MAb 32.3, Loke diagnostics, 
Arhus Denmark) to ensure that no contamination with C. 
trachomatis had occurred. The HeLa cells were tested by 
Hoechst stain for Mycoplasma contamination as well as by 
culture in BEa and BEg medium (Freund et al . , 1979). Also the 
C. pneumoniae stocks were also tested for Mycoplasma 
contamination by cultivation in BEa and BEg medium. No 
contamination with C. trachomatis, Mycoplasmas or bacteria 
were detected in cultures or cells. 72 hours post-infection 
the monolayer was washed in PBS , the cells were loosened in 
PBS with a rubber policeman, and the Chlamydia were liberated 
from the host cell by sonication. The C. pneumoniae EEs and 
RBs were purified on discontinuous density gradients 
(Miyashita et al . (1992)) . The purity of the Chlamydia EBs 
were verified by negative staining and electronmicroscopy 
(Figure 1), only particles of a size of 0 . 3 to 0 . 5 mm were 
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detected in agreement with the structure of C. pneumonia EBs 
The purified Chlamydia EBs were subjected to sarkosyl 
extraction as described by Caldwell et al (1981) with the 
modification that a brief sonication was used to suspend the 
COMC. The purified COMC was tested by electronmicroscopy and 
negative staining (Figure 1) , where a folded outer membrane 
complex was seen. 

SDS -PAGE analysis of purified EBs and COMC 

The proteins from purified EBs and C. pneumoniae OMC were 
separated on 15% SDS -polyacrylamide gel, and the gel was 
silver stained (Figure 2) , in lane 1 it is seen that the 
purified EBs contain major proteins of 100/95 kDa and a 
protein of 38 kDa, in the purified COMC (lane 2) these two 
protein groups are also dominant. In addition, proteins with 
a molecular weight of 62/60 kDa, 5 5 kDa, and 12 kDa have bee 
enriched in the COMC preparation. When the purified C. 
pneumoniae EBs are compared to purified C. trachomatis EB 
(lane 3) it is seen that predominant protein in the C. 
trachomatis EB is the major outer membrane protein (MOMP) , 
and it is also the dominant band in the COMC preparation of 
C. trachomatis (lane 4) , and Omp2 of 60/62 kDa as well as 
Omp3 at 12 kDa are seen in the preparation. However, no ma jo 
bands with a size of 100/95 kDa are detected as in the C. 
pneumoniae COMC preparation. 

Production of rabbit polyclonal antibodies against C. 
pneumoniae COMC 

To ensure production of rabbit antibodies that would 
recognize all the C. pneumoniae proteins in immuno-blottinc 
and colony-blotting 10 ^g of COMC antigen was dissolved in 2 
Ml of SDS sample buffer and thereafter divided into 5 vials. 
The dissolved antigen was further diluted in one ml of PBS 
and one ml of Freund incomplete adjuvant (Difco laboratories 
USA cat. No. 0639-60-6) and injected into the quadriceps 
muscle of a New Zealand white rabbit. The rabbit was giver; 
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three times intramuscular injections at an interval of one 
week, and after further three weeks the dissolved COMC 
protein, diluted in one ml PBS was injected intravenously, 
and the procedure was repeated two weeks later. Eleven weeks 
5 after the beginning of the immunization, the serum was 

obtained from the rabbit. Purified C. pneumoniae EBs were 
separated by SDS-PAGE, and the proteins were 

electrotransf erred to nitrocellulose membrane. The membrane 
was blocked and immunostained with the polyclonal COMC 
10 antibody (Figure 3) . The serum recognized proteins with a 

size of 100/95, 60 and 38 kDa in the EB preparation. This is 
in agreement with the sizes of the outer membrane proteins. 

Cloning of the COMC proteins 

Due to the cultivation of C. pneumoniae in HeLa cells, 
15 contaminating host cell DNA could be present in the EB 

preparations. Therefore, the purified EB preparations were 
treated with DNAse to remove contaminating DNA. The C. 
pneumoniae DNA was then purified by CsCl gradient 
centrifugation. The C. pneumoniae DNA was partially digested 
20 with Sau3A and the fractions containing DNA fragments with a 
size of approx. 0.5 to 4.0 kb were cloned into the expression 
vector system pEX (Boehringer, Germany cat. No. 1034 766, 
1034 774, 1034 782) . The pEX vector system has a 
/3-galactosidase gene with multiple cloning sites in the 3 ' end 
25 of the /3-galactosidase gene. Expression of the gene is 

regulated by the PR promoter, so the protein expression can 
be induced by elevating the temperature from 32 to 42°C. The 
colonies of recombinant bacteria were transferred to 
nitrocellulose membranes, and the temperature was increased 
to 4 2°C for two hours. The bacteria were lysed by placing the 
nitrocellulose membranes on filters soaked in 5% SDS . The 
colonies expressing outer membrane proteins were detected 
with the polyclonal antibody raised against C. pneumoniae 
COMC. The positive clones were cultivated in suspension and 
induced at 42°C for two hours. The protein profile of the 
clones were analysed by SDS - PAGE , and increases in the size 



30 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 98/58953 




PCT/DK98/00266 



of the induced b-galactosidase were observed (Figure 4) In 
addition, the proteins were electrotransf erred to 
nitrocellulose membranes, and the reaction with the 
polyclonal serum against COMC was confirmed (Figure 5) 

Sequencing of positive COMC clones 

To characterize the pEX clones, the inserted C. pneumoniae 
DNA was sequenced. The resulting DNA sequences were searched 
against the prokaryotic sequences in the GenEmbl database. 
The search identified 6 clones as part of the Omp2 gene, and 
2 clones as part of the Omp3 gene, and 2 clones as part of 
the MOMP gene, indicating that COMC proteins had been 
successfully cloned. Furthermore, 32 clones were obtained, 
containing DNA sequences not found in the GenEmbl database. 
These sequences could, however, be clustered in two contics 
of 6 and 4 clones , and three clones were identical . In 
addition 19 clones were found with no overlap to the contics 
(Figure 7) . To obtain more sequence data for the genes, C. 
pneumoniae DNA was totally digested with BamHI restriction 
enzyme, and the fragments were cloned into the vector 
pBluescript. The ligated DNA was electrotransf ormed into E . 
coli XLl-Blue and selected on plates containing Ampicillin. 
The recombinant bacterial colonies were transferred to a 
nitrocellulose membrane, and colony hybridisation was 
performed using the inserts of pEX l-l clone as a probe. A 
clone containing a single BamHI fragment of 4.5 kb was found, 
and the hybridisation to the probe was confirmed by Southern 
blotting. The insert of the clone was sequenced 

bi-directionally using synthetic primers for approx. each 300 
bp. The sequence of the BamHI fragment made it possible to 
join the two contics of pEX clones. Totally, together with 
the pEX clones it was possible to assemble 6.5 kb DNA 
sequence, encoding two new COMC proteins. (Figure 6) 

Additional sequences were obtained by PCR performed on 
purified C. pneumoniae DNA with primers both from the known 
Omp genes and from other known genes. The obtained PCR 
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products were sequenced, The sequence organisation is shown 
in Fig. 7. Additional 8 Omp genes were detected. The 
alignment of the deduced amino acid sequences are shown in 
Fig . 8 A and B . 

5 Analysis of DNA sequence 

The DNA sequence encoding the Omp4-15 proteins with a size of 
89.6-100.3 kDa (and for Ompl3 : 56.1 kDa) . Omp4 and Omp5 were 
transcribed in opposite directions. Downstream 0mp4 a 
possible termination structure was located. The 3 ' end of the 
Om P 5 gene was not cloned due to the presence of the BamHI 
restriction enzyme site positioned within the gene. The 
translated DNA sequence of Omp4 and OtnpS was compared by use 
of the gap programme in the GCG package (Wisconsin package, 
version 8.1-UNIX, August 1995, sequence analysis software 
package) . The two genes had an amino acid identity of 41% 
(similarity 61%) , and a possible cleavage site for signal 
peptidase 1 was present at amino acid 17 in Omp4 and amino 
acid 2 5 in Omp5 . When the amino acid sequence encoded by two 
other pEX clones were compared to the sequence of 0mp4 and 
Omp5 they also had amino acid homology to the genes. It is 
seen that the two clones have homology to the same area in 
the Omp4 and Omp5 proteins. Consequently, the pEX clones must 
have originated from two additional genes. Therefore these 
genes were named Omp6 and Omp7 . Similar analyses were 
performed with the other genes. In contrast to what was seen 
for Omp4 and 5 none of the other putative omp proteins had a 
cleavage site for signal peptides. 

EXAMPLE 2 

Polyclonal monospecific antibodies against pEX fusion 
3 0 proteins and full length recombination + Omp4 

To investigate the topology of the Omp4-7 proteins, 
representative pEX clones, were selected from each gene. The 
fusion proteins of 0 -galactosidase/omp were induced, and the 
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proteins were partially purified as inclusion bodies. Balb/c 
mice were immunized three times intramuscular with the 
antigens at an interval of one week, and after six weeks the 
serum was obtained from the mice. HeLa cells were infected 
5 with the C. pneumoniae. 72 hours after the infection the 

mono-layers were fixed with 3.7% formaldehyde. This treatment 
makes the outer membrane of the Chlamydia impermeable for 
antibodies due to the extensive cross -linking of the outer 
membrane proteins by the formaldehyde. The HeLa cells were 
10 permeabilized with 0.2% Triton X100, the monolayers were 
washed in PBS, then incubated with 20% (v/v) FCS to 
inactivate free radicals of the formaldehyde. The mice sera 
were diluted 1:100 PBS with 20% (v/v) FCS and incubated with 
the monolayers for half an hour. The monolayers were washed 
15 in PBS and secondary FITCH conjugated rabbit anti mouse serum 
was added for half an hour, and the monolayers were washed 
and mounted. Several of the antibodies reacted strongly with 
the EBs in the inclusions (Figure 9) . in spite of the 
formaldehyde fixation it could not be excluded that the 
surface of the EB was changed by the treatments, so that the 
antibodies could get access to the Omp4-7. Therefore, the 
reaction was confirmed by immuno- electron microscopy with the 
antibody raised against clone pEX3-36. Purified EB of C. 
pneumoniae were absorbed to carbon coated nickel grids. After 
the absorption the grids were washed with PBS and blocked in 
0.5% Ovalbumin dissolved in PBS. The antibodies were diluted 
1:100 in the same buffer and incubated for 3 0 minutes. The 
grids were washed in PBS. Rabbit anti mouse Ig conjugated 
with 10nm colloidal gold diluted in PBS containing 1% gelatin 
was added to the grids for half an hour. The grids were 
washed in 3 x PBS with 1% gelatin and 3 times in PBS, the 
grids were contrastained with 0.7% phospho tungstic acid. TK- 
grids were analysed in a Jeol 1010 electron microscope at 4 0 
kV. It was seen that the gold particles were covering the 
3 5 surface of the purified EB . Because the C. pneumoniae EBs 
were not exposed to any detergent or fixation under either 
the purification or the reaction with antibodies, these 
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results show that the cloned proteins have surface exposed 
epitopes . 

Polyclonal monospecific antibodies against Omp4 

The Omp4 gene was amplified by PCR with primers that 
5 contained LIC-sites, and the PCR product was cloned into the 
pET-3 0 LIC vector (Novagen) . The histidine tagged fusion 
protein was expressed by induction of the synthesis by IPTG 
and purified over a nickel column. The purified 0mp4 protein 
was used for immunization of a rabbit (six times, 8 ng each 
10 time) . 



Use of rabbit polyclonal antibodies to recombinant Omp4 for 
detection of Chlamydia pneumoniae in paraffin embedded 
sections 

The lungs of C. pneumoniae infected mice were obtained three 
15 days after intranasal infection. The tissue samples were 

fixed in 4% formaldehyde, paraffin embedded, sectioned and 
deparaf finized prior to staining. The sections were incubated 
with the rabbit serum diluted 1:200 in TBS ( 150 mM NaCl, 
2 0mM Tris pH 7.5) for 3 0 min at room temperature. After wash 
20 two times in TBS the sections were incubated with the 

secondary antibody (biotinylated goat anti -rabbit antibodies) 
diluted 1:300 in TBS , followed by two times wash in TBS. The 
sections were stained with streptavidin-biotin complex 
(streptABComplex/AP, Dako) for 3 0 min washed and developed 
2 5 under microscopic inspection with chromagen + new fuchsin ' 

(Vector laboratories) . The sections were counter stained with 
Hematoxylin and analyzed ny microscopy. 

Immuno blotting analysis with hyperimmune monospecific rabbit 
anti -serum 



The insert of pEXl-l clone was amplified by PCR using primers 
containing LIC sites. The PCR product could therefore be 
inserted in the pET-3 2 LIC vector (Novagen, UK cat No. 69076- 
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1) . Thereby the insert sequence of the pEXl-l clone was 
expressed in the new vector as a fusion protein, the part of 
the fusion protein encoded by the pET-3 2 LIC vector had 6 
histidine residues in a row. The expression of the fusion 
5 protein was induced in this vector, and the fusion protein 

could be purified under denaturing condition on a Ni2+ column 
due to the high affinity of the histidine residues to 
divalent cations. The purified protein was used for 
immunization of a New Zealand white rabbit. After 6 times 

10 intramuscular and 2 times intravenous immunization the serum 
was obtained from the rabbit. Purified C. pneumoniae EB was 
dissolved in SDS-sample buffer. Half of the sample was heated 
to 100°C in the sample buffer, whereas the other half of the 
sample was not heated. The samples were separated by 

15 SDS-PAGE, and the proteins were transferred to 

nitrocellulose, the serum was reacted with the strips. With 
the samples heated to 100°C the serum recognized a high 
molecular weight band of approximately 9 8 kDa. This is in 
agreement with the predicted size of Omp5 , of which the 

2 0 pEXl-l clone is a part, however, when the antibody was 

reacted to the strip with unheated EB, the pattern was 
different. Now a band was seen with a size of 75 kDa, in 
addition weaker bands were observed above the band (Figure 
10) . These data demonstrate that Omp5 needs boiling in 
25 SDS-sample buffer to be fully denatured and migrate with a 
size as predicted from the gene product. When the samples 
were not boiled, the protein was not fully denatured and less 
SDS binds to the protein and it has a more globular structure 
that will migrate faster in the acrylamide gel. The band 

3 0 pattern looked identical to what was obtained with a 

monoclonal antibody (MAb 26.1) (lane 6), we earlier have 
described (Christiansen et al . , 1994), reacting with the 
surface of C. pneumoniae EB, but the antibody do not react 
with the fully SDS denatured C. pneumoniae EB in 
3 5 immunoblotting . 
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Experimental infection of C57 black mice 

Due to the realization of the altered migration of the 0mp4-7 
proteins without boiling, we chose to analyse antibodies 
against C. pneumoniae EBs after an experimental infection of 
5 mice. To obtain antibodies from an infection caused by C. 

pneumoniae, C57 black mice were inoculated intranasally with 
10 7 CFI of C. pneumoniae under a light ether anaesthesia. 
After 14 days of infection the serum samples were obtained 
and the lungs were analysed for pathological changes. In two 

10 of the mice a severe pneumonia was observed in the lung 
sections, and in the third mouse only minor changes were 
observed. The serum from the mice was diluted 1:100 and 
reacted with purified EBs dissolved in sample buffer with and 
without boiling. In the preparations that had been heated to 

15 100°C the sera from two of the mice reacted strongly with 
bands of 60/62 kDa and weaker bands of 5 5 kDa, but no 
reaction was observed with proteins of the size of Omp4-7 
(Figure 11) . However, when the sera were reacted with the 
preparation that had not been heated they all had a strong 

20 reaction with a broad band of an approximate size of 75 kDa. 
This is in agreement with the size of the Omp4-7 proteins in 
the unheated preparation. Therefore, it could be concluded 
that the epitopes of the Omp4-7 proteins recognized by the 
antibodies after a C. pneumoniae infection were discontinuous 

2 5 epitopes because the full denaturation of the antigen 
completely destroyed the epitopes. The 75 kDa protein 
observed in unheated samples is not Omp2 (Shown in 
immunoblotting with an 0mp2 specific antibody) 

EXAMPLE 3 
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Comparison of 0mp4-7 of C. pneumoniae with putative outer 
membrane proteins (POMP) of C. psittaci 

Longbottom et al . (1996) have published partial sequence from 
98 to 90 kDa proteins from C. psittaci. They have entered the 
full sequence of 5 genes in this family in the EMBL database. 
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They have named the genes "putative outer membrane proteins" 
(POMP) since their precise location was not determined. The 
family is composed of two genes that are completely 
identical, and two genes with high homology to these genes. 
They calculated a molecular size of 90 and 91 kDa. The 5th 
encode a protein of 9 8 kDa. The sequence of the Omp4-7 
proteins of C. pneumoniae were compared to the sequences of 
the C. Psittaci POMP proteins with the programme pileup in 
the GCG package . The amino acid homologies were in the range 
of 51-63%. It is seen that the C. pneumoniae Omp4-5 proteins 
are most related to the 98 kDa POMP protein of C. psittaci . 
Interestingly, the 98 kDa C. psittaci POMP protein is more 
related to the C. pneumoniae genes than to the other C. 
psittaci genes. The repeated sequences of GGAI were conserved 
15 in the 98 kDa POMP protein, but only three GGAI repeats were 
present in the 9 0 and 91 kDa C. psittaci POMP proteins. For 
C. psittaci it has been shown that antibodies to these 
proteins seem to be protective for the infection. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT 

(A) NAME: Svend Birkelund 

(B) STREET: Dept. of Medical Microbiology and Immunology, 

University of Arhus 

(C) CITY; Arhus C 

(D) STATE OR PROVINCE: 

(E) COUNTRY: Denmark 

(F) POSTAL CODE: 8000 

(ii) TITLE OF THE INVENTION: Chlamydia pneumoniae anci 

gens 

(iii) NUMBER OF SEQUENCES: 3 0 

(iv) COMPUTER -READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER; IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 205... 2987 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAATGTCGAA GAGAGCACTA ACCAGGAAAA TTGCGATTTC ATAAACCCAC TTTATTATTA 6 0 

AAT T CTT ACT TG CGT CAT AT AAAATAGAAA ACTCAGAGAG TCAAGATAAA AATTCTTGAC 12 0 

AG CTGTTTTG TCATCTTTAA CTTGATTTAC TTATTTTGTT TCTATATTGA TGCGAATAGT 13 0 

TCTCTAAAAA ACAAAAGCAT TACC ATG AAG ACT TCG ATT CCT TGG GTT TTA 231 

Met Lys Thr Ser lie Pro Trp Val Leu 
1 5 

GTT TCC TCC GTG TTA GCT TTC TCA TGT CAC CTA CAG TCA CTA GCT AAC 27 9 
Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin Ser Leu Ala Asn 
10 15 20 25 
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GAG GAA CTT TTA TCA CCT GAT GAT AGC TTT AAT GGA AAT ATC GAT TCA 32 7 

Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly Asn He Asp Ser 
30 35 40 

GGA ACG TTT ACT CCA AAA ACT TCA GCC ACA ACA TAT TCT CTA ACA GGA 375 
Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr Ser Leu Thr Gly 
45 50 55 

GAT GTC TTC TTT TAC GAG CCT GGA AAA GGC ACT CCC TTA TCT GAC AGT 42 3 

Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro Leu Ser Asp Ser 
60 65 70 

TGT TTT AAG CAA ACC ACG GAC AAT CTT ACC TTC TTG GGG AAC GGT CAT 4 71 

Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu Gly Asn Gly His 
75 80 85 

AGC TTA ACG TTT GGC TTT ATA GAT GCT GGC ACT CAT GCA GGT GCT GCT 519 
Ser Leu Thr Phe Gly Phe He Asp Ala Gly Thr His Ala Gly Ala Ala 
90 95 100 105 

GCA TCT ACA ACA GCA AAT AAG AAT CTT ACC TTC TCA GGG TTT TCC TTA / 56 7 
Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser Gly Phe Ser Leu 
13 -0 115 120 

CTG AGT TTT GAT TCC TCT CCT AGC ACA ACG GTT ACT ACA GGT CAG GGA 615 
Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr Thr Gly Gin Gly 
125 130 135 

ACG CTT TCC TCA GCA GGA GGC GTA AAT TTA GAA AAT ATT CGT AAA CTT 663 
Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn lie Arg Lys Leu 
140 145 150 

GTA GTT GCT GGG AAT TTT TCT ACT GCA GAT GGT GGA GCT ATC AAA GGA 711 
Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly Ala He Lys Gly 
155 160 165 

GCG TCT TTC CTT TTA ACT GGC ACT TCT GGA GAT GCT CTT TTT AGT AAC 759 
Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala Leu Phe Ser Asn 
170 175 180 185 

AAC TCT TCA TCA ACA AAG GGA GGA GCA ATT GCT ACT ACA GCA GGC GCT 80 7 

Asn Ser Ser Ser Thr Lys Gly Gly Ala He Ala Thr Thr Ala Gly Ala 
190 195 200 

CGC ATA GCA AAT AAC ACA GGT TAT GTT AGA TTC CTA TCT AAC ATA GCG 85 5 

Arg He Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu Ser Asn He Ala 
205 210 215 

TCT ACG TCA GGA GGC GCT ATC GAT GAT GAA GGC ACG TCG ATA CTA TCG 90 3 

Ser Thr Ser Gly Gly Ala He Asp Asp Glu Gly Thr Ser He Leu Ser 
220 225 230 

AAC AAC AAA TTT CTA TAT TTT GAA GGG AAT GCA GCG AAA ACT ACT GGC 951 
Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala Lys Thr Thr Gly 
235 240 245 

GGT GCG ATC TGC AAC ACC AAG GCG AGT GGA TCT CCT GAA CTG ATA ATC 999 
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Gly Ala lie Cys Asn Thr Lys Ala Ser Gly Ser Pro Glu Leu He He 
250 255 260 265 

TCT AAC AAT AAG ACT CTG ATC TTT GCT TCA AAC GTA GCA GAA ACA AGC 104 7 

Ser Asn Asn Lys Thr Leu He Phe Ala Ser Asn Val Ala Glu Thr Ser 
270 275 280 

GGT GGC GCC ATC CAT GCT AAA AAG CTA GCC CTT TCC TCT GGA GGC TTT 10 9 5 

Gly Gly Ala He His Ala Lys Lys Leu Ala Leu Ser Ser Gly Gly Phe 
285 290 295 

ACA GAG TTT CTA CGA AAT AAT GTC TCA TCA GCA ACT CCT AAG GGG GGT 1143 
Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr Pro Lys Gly Gly 
300 305 310 

GCT ATC AGC ATC GAT GCC TCA GGA GAG CTC AGT CTT TCT GCA GAG ACA 1191 
Ala He Ser He Asp Ala Ser Gly Glu Leu Ser Leu Ser Ala Glu Thr 

320 325 

GGA AAC ATT ACC TTT GTA AGA AAT ACC CTT ACA ACA ACC GGA AGT ACC 12 3 9 

Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr Thr Gly Ser Thr 
330 335 340 345 

GAT ACT CCT AAA CGT AAT GCG ATC AAC ATA GGA AGT AAC GGG AAA TTC 12 8 7 

Asp Thr Pro Lys Arg Asn Ala He Asn He Gly Ser Asn Gly Lys Phe 
350 355 360 

ACG GAA TTA CGG GCT GCT AAA AAT CAT ACA ATT TTC TTC TAT GAT CCC 13 3 5 

Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe Phe Tyr Asp Pro 
365 370 375 

ATC ACT TCA GAA GGA ACC TCA TCA GAC GTA TTG AAG ATA AAT AAC GGC 13 8 3 

He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys He Asn Asn Gly 
380 385 390 

TCT GCG GGA GCT CTC AAT CCA TAT CAA GGA ACG ATT CTA TTT TCT GGA 14 31 

Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr He Leu Phe Ser Gly 
395> 400 405 

GAA ACC CTA ACA GCA GAT GAA CTT AAA GTT GCT GAC AAT TTA AAA TCT 14 7 9 

Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp Asn Leu Lys Ser 
410 415 420 425 

TCA TTC ACG CAG CCA GTC TCC CTA TCC GGA GGA AAG TTA TTG CTA CAA 152 7 

Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys Leu Leu Leu Gin 
430 435 440 

AAG GGA GTC ACT TTA GAG AGC ACG AGC TTC TCT CAA GAG GCC GGT TCT 157 5 

Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin Glu Ala Gly Ser 
445 450 455 

CTC CTC GGC ATG GAT TCA GGA ACG ACA TTA TCA ACT ACA GCT GGG AGT 1623 
Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr Thr Ala Gly Ser 
460 465 470 

ATT ACA ATC ACG AAC CTA GGA ATC AAT GTT GAC TCC TTA GGT CTT AAG 16 71 

He Thr He Thr Asn Leu Gly He Asn Val Asp Ser Leu Gly Leu Lys 
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47 5 480 485 

CAG CCC GTC AGC CTA ACA GCA AAA GGT GCT TCA AAT AAA GTG ATC GTA 17X9 
Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn Lvs Val lie Val 
490 495 500 * 505 

TCT GGG AAG CTC AAC CTG ATT GAT ATT GAA GGG AAC ATT TAT GAA AGT 17 67 

Ser Gly Lys Leu Asn Leu lie Asp He Glu Gly Asn He Tyr Glu Ser 
510 515 520 

CAT ATG TTC AGC CAT GAC CAG CTC TTC TCT CTA TTA AAA ATC ACG GTT 181S 
His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu Lys lie Thr Val 
525 530 535 

GAT GCT GAT GTT GAT ACT AAC GTT GAC ATC AGC AGC CTT ATC CCT GTT 186 3 

Asp Ala Asp Val Asp Thr Asn Val Asp He Ser Ser Leu He Pro Val 
540 545 550 

CCT GCT GAG GAT CCT AAT TCA GAA TAC GGA TTC CAA GGA CAA TGG AAT 1911 
Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin Gly Gin Trp Asn 
555 560 565 

GTT AAT TGG ACT ACG GAT ACA GCT ACA AAT ACA AAA GAG GCC ACG GCA 1959 
Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys Glu Ala Thr Ala 
570 575 580 585 

ACT TGG ACC AAA ACA GGA TTT GTT CCC AGC CCC GAA AGA AAA TCT GCG 2 007 
Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu Arg Lys Ser Ala 
590 595 600 

TTA GTA TGC AAT ACC CTA TGG GGA GTC TTT ACT GAC ATT CGC TCT CTG 2 0 55 
Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp He Arg Ser Leu 
605 610 615 

CAA CAG CTT GTA GAG ATC GGC GCA ACT GGT ATG GAA CAC AAA CAA GGT 210 3 
Gin Gin Leu Val Glu He Gly Ala Thr Gly Met Glu His Lys Gin Gly 
620 625 630 

TTC TGG GTT TCC TCC ATG ACG AAC TTC CTG CAT AAG ACT GGA GAT GAA 2151 
Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys Thr Gly Asp Glu 
635 640 645 

AAT CGC AAA GGC TTC CGT CAT ACC TCT GGA GGC TAC GTC ATC GGT GGA 2199 
Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr Val II- Gly Gly 
650 S55 660 665 

AGT GCT CAC ACT CCT AAA GAC GAC CTA TTT ACC TTT GCG TTC TGC CAT 224 7 
Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe Ala Phe Cys His 
670 675 680 

CTC TTT GCT AGA GAC AAA GAT TGT TTT ATC GCT CAC AAC AAC TCT AGA 2 2 95 
Leu Phe Ala Arg Asp Lys Asp Cys Phe He Ala His Asn Asn Ser Arg 
685 690 695 

ACC TAC GGT GGA ACT TTA TTC TTC AAG CAC TCT CAT ACC CTA CAA CCC 2 34 3 
Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His Thr Leu Gin Pro 
700 705 710 
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CAA AAC TAT TTG AGA TTA GGA AGA GCA AAG TTT TCT GAA TCA GCT ATA 23 91 

Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser Glu Ser Ala lie 
715 * 720 725 

GAA AAA TTC CCT AGG GAA ATT CCC CTA GCC TTG GAT GTC CAA GTT TCG 24 3 9 

Glu Lys Phe Pro Arg Glu lie Pro Leu Ala Leu Asp Val Gin Val Ser 
730 735 740 745 

TTC AGC CAT TCA GAC AAC CGT ATG GAA ACG CAC TAT ACC TCA TTG CCA 24 37 

Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr Thr Ser Leu Pro 
750 755 760 

GAA TCC GAA GGT TCT TGG AGC AAC GAG TGT ATA GCT GGT GGT ATC GGC 2 53 5 

Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys lie Ala Gly Gly lie Gly 
765 770 * 775 

CTA GAC CTT CCT TTT GTT CTT TCC AAC CCA CAT CCT CTT TTC AAG ACC 2 58 3 

Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro Leu Phe Lys Thr 
780 785 790 

TTC ATT CCA CAG ATG AAA GTC GAA ATG GTT TAT GTA TCA CAA AAT AGC 26 3 1 

Phe lie Pro Gin Met Lys Val Glu Met Val Tyr Val Ser Gin Asn Ser 
795 800 805 

TTC TTC GAA AGC TCT AGT GAT GGC CGT GGT TTT AGT ATT GGA AGG CTG 26 7 9 

Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser lie Gly Arg Leu 
810 815 820 825 

CTT AAC CTC TCG ATT CCT GTG GGT GCG AAA TTC GTG CAG GGG GAT ATC 2 72 7 

Leu Asn Leu Ser lie Pro Val Gly Ala Lys Phe Val Gin Gly Asp lie 
830 835 840 

GGA GAT TCC TAC ACC TAT GAT CTC TCA GGA TTC TTT GTT TCC GAT GTC 277 5 

Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe Val Ser Asp Val 
845 850 855 

TAT CGT AAC AAT CCC CAA TCT ACA GCG ACT CTT GTG ATG AGC CCA GAC 2 82 3 

Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val Met Ser Pro Asp 
860 865 870 

TCT TGG AAA ATT CGC GGT GGC AAT CTT TCA AGA CAG GCA TTT TTA CTG 2 871 

Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin Ala Phe Leu Leu 
875 880 885 

AGG GGT AGC AAC AAC TAC GTC TAC AAC TCC AAT TGT GAG CTC TTC GGA 2 919 

Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys Glu Leu Phe Gly 
890 895 900 905 

CAT TAC GCT ATG GAA CTC CGT GGA TCT TCA AGG AAC TAC AAT GTA GAT 2 96 7 

His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn Tyr Asn Val Asp 
910 915 920 

GTT GGT ACC AAA CTC CGA TT CTAGATTGCT AAAACTCCCT AGTTCTTCTA GGGAG 3 02 2 
Val Gly Thr Lys Leu Arg Phe 
925 

TTTT CT CAT A CTTTTAGGGA AATATTTGCT ATAGGGAATG CTTTCCTTGC AAACTGTAAA 3 0 8?. 
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40 

AAATAACATT TGTCCCTCTT CAAAAAAGAT TTCTTTTAAT AATTTCTAGT TATAATTTTA 3142 

TTTTAAAAAC AGTTAAATAA TTAATAGACA ATAATCTATT CTTATTGACT TCTTTTTT 32 00 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Thr Ser He Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe 

15 10 15 

Ser Cys His Leu Gin Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp 

20 25 30 

Asp Ser Phe Asn Gly Asn He Asp Ser Gly Thr Phe Thr Pro Lys Thr 

35 40 45 

Ser Ala Thr Thr Tyr Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro 

50 55 60 

Gly Lys Gly Thr Pro Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp 
65 7 ° 75 80 

Asn Leu Thr Phe Leu Gly Asn Gly His Ser Leu Thr Phe Gly Phe He 

85 90 95 

Asp Ala Gly Thr His Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys 

100 105 110 

Asn Leu Thr Phe Ser Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro 

115 120 125 

Ser Thr Thr Val Thr Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly 

130 135 140 

Val Asn Leu Glu Asn He Arg Lys Leu Val Val Ala Gly Asn Phe Ser 
X " > 150 155 160 

Thr Ala Asp Gly Gly Ala He Lys Gly Ala Ser Phe Leu Leu Thr Gly 

165 170 175 

Thr Ser Gly Asp Ala Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly 

180 185 190 

Gly Ala He Ala Thr Thr Ala Gly Ala Arg He Ala Asn Asn Thr Gly 

195 200 205 

Tyr Val Arg Phe Leu Ser Asn He Ala Ser Thr Ser Gly Gly Ala He 

210 215 220 

Asp Asp Glu Gly Thr Ser He Leu Ser Asn Asn Lys Phe L»u Tyr Phe 
225 230 235 240 

Glu Gly Asn Ala Ala Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys 

245 250 255 

Ala Ser Gly Ser Pro Glu Leu He He Ser Asn Asn Lys Thr Leu He 

250 265 270 

Phe Ala Ser Asn Val Ala Glu Thr Ser Gly Gly Ala He His Ala Lys 

275 280 285 

Lys Leu Ala Leu Ser Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn 

29 0 295 300 

Val Ser Ser Ala Thr Pro Lys Gly Gly Ala He Ser He Asp Ala Ser 
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Ser 


Glu 
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Trp 


Ser 
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765 
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Asn Glu Cys He Ala Gly Gly He Gly Leu Asp Leu Pro Phe Val Leu 

770 775 780 

Ser Asn Pro His Pro Leu Phe Lys Thr Phe He Pro Gin Met Lys Val 
785 790 795 800 

Glu Met Val Tyr Val Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp 

805 810 815 

Gly Arg Gly Phe Ser He Gly Arg Leu Leu Asn Leu Ser He Pro Val 

820 825 830 

Gly Ala Lys Phe Val Gin Gly Asp He Gly Asp Ser Tyr Thr Tyr Asp 

835 840 845 

Leu Ser Gly Phe Phe Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser 

850 855 860 

Thr Ala Thr Leu Val Met Ser Pro Asp Ser Trp Lys He Arg Gly Gly 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val 

885 890 895 

Tyr Asn Ser Asn Cys Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg 

900 905 910 

Gly Ser Ser Arg Asn Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
915 920 925 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGAAATCGC AATTTTCCTG GTTAGTG CTC TCTTCGACAT TGGCATGTTT TACTAGTTGT 60 

TCCACTGTTT TTG CTGCAAC TGCTGAAAAT ATAGGCCCCT CTG AT AG CTT TGACGGAAGT 120 

ACTAACACAG GCACCTATAC TCCTAAAAAT ACGACTACTG GAATAGACTA TACTCTGACA 180 

GGAGATATAA CTCTGCAAAA CCTTGGGGAT TCGGCAGCTT TAACGAAGGG TTGTTTTTCT 24 0 

GACACTACGG AATCTTTAAG CTTTGCCGGT AAGGGGTACT CACTTTCTTT TTTAAATATT 3 00 

AAGTCTAGTG CTGAAGGCGC AGCACTTTCT GTTACAACTG ATAAAAATCT GTCGCTAACA 360 

GGATTTTCGA GTCTTACTTT CTTAGCGGCC CCATCATCGG TAATCACAAC CCCCTCAGGA 420 

AAAGGTGCAG TTAAATGTGG AGGGGATCTT ACATTTGATA ACAATGGAAC TATTTTATTT 4 80 

AAACAAGATT ACTGTGAGGA AAATGGCGGA GCCATTTCTA CCAAGAATCT TTCTTTGAAA 54 0 

AACAGCACGG GATCGATTTC TTTTGAAGGG AATAAATCGA GCGCAACAGG GAAAAAAGGT 600 
GGGGCTATTT GTGCTACTGG TACTGTAGAT ATTACAAATA ATACGGCTCC TACCCTCTTC 6 60 

TCGAACAATA TTGCTGAAGC TGCAGGTGGA GCTATAAATA G CAC AGG AAA CTGTACAATT 720 
ACAGGGAATA CGTCTCTTGT ATTTTCTGAA AAT AGTGTG A CAGCGACCGC AGGAAATGGA 7 80 

GGAGCTCTTT CTGGAGATGC CGATGTTACC AT AT CTG GG A AT CAG AGTGT AACTTTCTCA 84 0 

GGAAACCAAG CTGTAGCTAA TGGCGGAGCC ATTTATGCTA AGAAGCTTAC ACTGGCTTCC 900 
GGGGGGGGGG GGGGTATCTC CTTTT CTAAC AAT ATAGTC C AAGGT AC CAC TGCAGGTAAT 96 0 

GGTGGAGCCA TTTCTATACT GGCAGCTGGA GAGTGTAGTC TTTCAGCAGA AG CAGGGG AC 1020 

ATT AC CTT C A ATGGGAATGC CATTGTTGCA ACTACACCAC AAACTACAAA AAGAAATTCT 10 80 

ATTGACATAG GATCTACTGC AAAGAT CACG AATTTACGTG CAATATCTGG GCATAGCATC 114 0 

TTTTTCTACG ATCCGATTAC TGCTAATACG GCTGCGGATT CTACAGATAC TTTAAATCTC 12 0 0 

AATAAGGCTG ATGCAGGTAA TAGTACAGAT TATAGTGGGT CGATTGTTTT TTCTGGTGAA 12 6 0 
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AAGCTCTCTG AAGATGAAGC AAAAGTTGCA GACAACCTCA CTTCTACGCT GAAGCAGCCT 1320 

GTAACTCTAA CTGCAGGAAA TTTAGTACTT AAACGTGGTG TCACTCTCGA TACGAAAGGC 13 80 

TTT ACT CAG A CCGCGGGTTC CTCTGTTATT ATGGATGCGG G C AC AACGTT AAAAGCAAGT 14 4 0 

ACAGAGGAGG TCACTTTAAC AGGTCTTTCC ATTCCTGTAG ACTCTTTAGG CGAGGGTAAG 1500 

AAAGTTGTAA TTGCTGCTTC TGCAGCAAGT AAAAATGTAG CCCTTAGTGG TCCGATTCTT 1560 

CTTTTGGATA ACCAAGGGAA TGCTTATGAA AATCACGACT TAGGAAAAAC TCAAGACTTT 1620 

TCATTTGTGC AGCTCTCTGC TCTGGGTACT GCAACAACTA CAGATGTTCC AGCGGTTCCT 16 80 

ACAGTAGCAA CTCCTACGCA CTATGGGTAT CAAGGTACTT GGGGAATGAC TTGGGTTGAT 174 0 

GATACCGCAA GCACTCCAAA GACTAAGACA GCGACATTAG CTTGGACCAA TACAGGCTAC 1800 

CTTCCGAATC CTGAGCGTCA AGGACCTTTA GTTCCTAATA GCCTTTGGGG AT CTTTTT C A 18 60 

GACATCCAAG CGATTCAAGG TGTCATAGAG AGAAGTGCTT TGACTCTTTG TT CAGATCG A 1920 

GGCTTCTGGG CTGCGGGAGT CGCCAATTTC TTAGATAAAG ATAAGAAAGG GGAAAAACGC 19 80 

AAATACCGTC ATAAATCTGG TGGATATGCT ATCGGAGGTG CAGCGCAAAC TTGTTCTGAA 2 04 0 

AACTTAATTA GCTTTGCCTT TTGCCAACTC TTTGGTAGCG ATAAAGATTT CTTAGTCGCT 2100 

AAAAATCATA CTGATACCTA TGCAGGAGCC TTCTATATCC AACACATTAC AGAATGTAGT 2160 

GGGTT CAT AG GTTGTCTCTT AGATAAACTT CCTGGCTCTT GGAGTCATAA ACCCCTCGTT 222 0 

TTAGAAGGGC AGCTCGCTTA TAGCCACGTC AGTAATGATC TGAAGACAAA. GTATACTGCG 2 2 80 

TATCCTGAGG TGAAAGGTTC TTGGGGGAAT AATGCTTTTA ACATGATGTT GGGAGCTTCT 2 3 40 

TCTCATTCTT ATCCTGAATA CCTGCATTGT TTTGATACCT ATGCTCCATA CATCAAACTG 2 4 00 

AATCTGACCT ATATACGTCA GGACAGCTTC TCGGAGAAAG GTACAGAAGG AAG AT CTTTT 2 4 60 

G ATG AC AG C A ACCTCTTCAA TTTATCTTTG CCTATAGGGG TGAAGTTTGA GAAGTTCTCT 2 52 0 

GATTGTAATG ACTTTTCTTA TGATCTGACT TTATCCTATG TTCCTGATCT TATCCGCAAT 2 5 80 

GATCCCAAAT GCACTACAGC ACTTGTAATC AGCGGAGCCT CTTGGGAAAC TTATGCCAAT 2 640 

AACTTAGCAC GACAGGCCTT GCAAGTGCGT GCAGGCAGTC ACTACGCCTT CTCTC CTATG 2 700 

TTTGAAGTG C TCGGCCAGTT TGTCTTTGAA GTT CGTGG AT CCTCACGGAT TTATAATGTA 2 760 

GAT CTTGGGG GTAAGTTCCA ATTCTAGGAG CGTCTCTCAT GTCTCAGAAA TTCTG 2 815 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 

1 5 10 15 

Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 

20 25 30 * 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 

35 40 45 

Lys Asn Thr Thr Thr Gly He Asp Tyr Thr Leu Thr Gly Asp He Thr 

50 55 60 

Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 

Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 

85 90 * 95 

Phe Leu Asn He Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 

100 105 110 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 

115 120 125 

Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser Gly Lys Gly Ala Val 
130 135 140 
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Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala He Ser Thr Lys Asn 

1^5 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser He Ser Phe Glu Gly Asn Lys 

ISO 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala He Cys Ala Thr Gly Thr 

195 200 205 

Val Asp He Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn He 

21° 215 220 

Ala Glu Ala Ala Gly Gly Ala He Asn Ser Thr Gly Asn Cys Thr He 
225 230 235 240 

Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr 

245 250 255 

Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr He Ser 

260 265 270 

Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly 

275 280 285 

Gly Ala He Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly 

2 90 295 300 

Gly He Ser Phe Ser Asn Asn He Val Gin Gly Thr Thr Ala Gly Asn 
305 310 315 320 

Gly Gly Ala He Ser He Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala 

325 330 335 

Glu Ala Gly Asp He Thr Phe Asn Gly Asn Ala He Val Ala Thr Thr 

340 345 350 

Pro Gin Thr Thr Lys Arg Asn Ser He Asp He Gly Ser Thr Ala Lys 

355 360 365 

He Thr Asn Leu Arg Ala He Ser Gly His Ser He Phe Phe Tyr Asp 

370 375 380 

Pro He Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu 
385 390 395 400 

Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val 

4 05 410 415 

Phe Ser Gly Glu - Lys Leu Ser Glu Asp Glu' Ala Lys Val Ala Asp Asn 

420 425 430 

Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu 

\ 435 440 445 

Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr 

4 50 455 460 

Ala Gly Ser Ser Val He Met Asp Ala Gly Thr Thr Leu Lys Ala Ser 
465 470 475 480 

Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He Pro Val Asp Ser Leu 

485 490 495 

Gly Glu Gly Lys Lys Val Val He Ala Ala Ser Ala Ala Ser Lys Asn 

500 505 510 

Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp Asn Gin Gly Asn Ala 

515 520 525 

Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin 

530 535 540 

Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro 
545 550 555 560 

Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met 

565 570 575 

Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr 

580 585 590 

Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3052 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGCGATTTT CGCT CTGCGG ATTTCCTCTA GTTTTTTCTT TAACATTGCT CTCAGTCTTC 60 
GACACTTCTT TGAGTGCTAC TACGATTTCT TTAACCCCAG AAGATAGTTT TCATGGAGAT 120 
AGTCAGAATG CAGAACGTTC TTATAATGTT CAAGCTGGGG ATGTCTATAG CCTTACTGGT 18 0 
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CAATGTGACC 


24 0 


TATTT C CT C A 


3 00 


GGCACGTTTT 


3 60 


ACAGGGATGT 


*± u 


TGAACAAAAC 


4 80 


AGGCAACTAC 


U 


TT CTT CAGGT 


con 

o u u 


TG C CAAG A AT 


fin 
o o u 


TG CTT ATGTT 


7 9 Pi 


TGTCTGTTC5T 

X X V. X \J X. X \J x 


7 ft n 


CAATAAAC" AP4 


O •** U 


TAGGAAAPTT 






c* c pi 


AG cag ac;zv a a 


i n "5 n 


WM^UrtiLi X 


i Aon 


AHAATTTTaT 
rtUnn x X X x.r\x 


1 1 


AfiATPrrA aa 


i onn 

LZ. V U 




i t c n 
IZfaU 


V- X X J-v<j 1 x A 1 1 


1 ^ "1 A 

lj 2 0 


W V— +\ Hi 1 1 






1*4 4 U 


a A apa crTT^Ti 


i c n n 


PA iTf!P f ""P A T" 


i c £ n 
lb o 0 


w V- X Lj w ALj v_ L. 


162 0 


TT ATP2 fZ f T H T r VT > 
x X Aluu X XXX 


t c o r\ 
lb OU 


aTTPTTPTPP 
■rvx x v_ x 1^,1 LjLj 


1/40 


T A. 7\T2\T r»!Mi»p 
X ftAiHl Li llj 


1800 


TPI P A T Pf* n /"* r* 


1860 






TPT A 21 ATfl at 


1980 


X rtUMurtV — 




PTATPAATAT 


z 1UU 


AAAPHTPPPP 


•)1 en 
ZloU 


P TPTP, PTT a T 


2 U 


HAAAAAPAPP 


O O £5 Pi 


RfJTATTTnAP. 




TTATfJPTTAT 


O /i n 


CI APTTT A A P A 


o a. a n 


PifiATPTaPTP 


T con 


ATRTP, aar, rr 


o c o n 


AARAPATPPT 




CTTATGTCGA 


2700 


AAGCAAATTT 


2760 


ATCCTGGACT 


2820 


TTTATATGCA 


2880 


CTTTATTTGT 


2940 


TCCGAGGAGT 


3000 


CA 


3052 



(2) INFORMATION FOR SEQ ID NO : 6 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 922 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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4 7 

(ii) MOLECULE TYPE: peptide 

Cxi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 

Met Arg Phe Ser Leu Cys Gly Phe Pro Leu Val Phe Ser Leu Thr Leu 

15 10 15 

Leu Ser Val Phe Asp Thr Ser Leu Ser Ala Thr Thr He Ser Leu Thr 

20 25 30 

Pro Glu Asp Ser Phe His Gly Asp Ser Gin Asn Ala Glu Arg Ser Tyr 

35 40 45 

Asn Val Gin Ala Gly Asp Val Tyr Ser Leu Thr Gly Asp Val Ser He 

50 55 60 

Ser Asn Val Asp Asn Ser Ala Leu Asn Lys Ala Cys Phe Asn Val Thr 
65 7 ° 75 80 

Ser Gly Ser Val Thr Phe Ala Gly Asn His His Gly Leu Tyr Phe Asn 

85 90 95 

Asn He Ser Ser Gly Thr Thr Lys Glu Gly Ala Val Leu Cys Cys Gin 

10 ° 105 no 

Asp Pro Gin Ala Thr Ala Arg Phe Ser Gly Phe Ser Thr Leu Ser Phe 

115 120 125 

He Gin Ser Pro Gly Asp He Lys Glu Gin Gly Cys Leu Tyr Ser Lys 

I 30 135 140 

Asn Ala Leu Met Leu Leu Asn Asn Tyr Val Val Arg Phe Glu Gin Asn 
145 150 155 160 

Gin Ser Lys Thr Lys Gly Gly Ala He Ser Gly Ala Asn Val Thr He 

lfi 5 170 175 

Val Gly Asn Tyr Asp Ser Val Ser Phe Tyr Gin Asn Ala Ala Thr Phe 

180 185 190 

Gly Gly Ala He His Ser Ser Gly Pro Leu Gin He Ala Val Asn Gin 

195 200 205 

Ala Glu He Arg Phe Ala Gin Asn Thr Ala Lys Asn Gly Ser Gly Glv 

210 215 22 0 

Ala Leu Tyr Ser Asp Gly Asp He Asp He Asp Gin Asn Ala Tyr Val 
225 2 30 235 240 

Leu Phe Arg Glu Asn Glu Ala Leu Thr Thr Ala He Gly Lys Gly Gly 

245 250 255 

Ala Val Cys Cys Leu Pro Thr Ser Gly Ser Ser Thr Pro Val Pro He 

1 ' 260 265 270 

Val Thr Phe Ser Asp Asn Lys Gin Leu Val Phe Glu Arg Asn His Ser 

275 280 285 

He Met Gly Gly Gly Ala He Tyr Ala Arg Lys Leu Ser He Ser Ser 

2 90 295 300 

Gly Gly Pro Thr Leu Phe He Asn Asn He Ser Tyr Ala Asn Ser Gin 
305 310 315 320 

Asn Leu Gly Gly Ala He Ala He Asp Thr Gly Gly Glu He Ser Leu 

325 330 335 

Ser Ala Glu Lys Gly Thr He Thr Phe Gin Gly Asn Arg Thr Ser Leu 

340 345 350 

Pro Phe Leu Asn Gly He His Leu Leu Gin Asn Ala Lys Phe Leu Lys 

355 360 365 

Leu Gin Ala Arg Asn Gly Cys Ser He Glu Phe Tyr Asp Pro He Thr 

370 375 380 

Ser Glu Ala Asp Gly Ser Thr Gin Leu Asn He Asn Gly Asp Pro Lys 
385 395 400 

Asn Lys Glu Tyr Thr Gly Thr He Leu Phe Ser Gly Glu Lys Ser Leu 

405 410 415 

Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr He Pro Gin Asn Val Asn 
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420 

Leu Ser Ala Gly Tyr 
435 

Ser Lys Phe Thr Gin 
450 

Thr Lys Leu lie Ala 
465 

lie Asp lie Asp Ser 
485 

Ala Asn Thr Ala Asn 
500 

lie Ser Pro Thr Gly 
515 

Gin Thr Phe Pro Leu 
530 

Thr Val Thr Ala Gly 
545 

Gin Gly Asn Trp Lys 
565 

Glu Phe Phe Trp Asp 
580 

Gly Asn Leu Val Pro 
595 

Ser Leu Met Gin Val 
610 

Arg Gly Leu Trp lie 
625 

Ser Glu Asp Asn lie 
645 

Ser Val Asn Asn Glu 
660 

Ser Gin Leu Phe Ser 
675 

Tyr Arg Met Tyr Leu 
690 

Gly Asn lie Phe Arg 
705 

lie Leu Ser Arg Arg 
725 

Leu Cys Ala Tyr Gly 
740 

Asn Phe Pro Met Val 
755 

Glu Cys Gly Gly Ser 
770 

Phe Gin Gly Ala He 
785 

Gin Gly Asp Phe Lys 
805 

Gly Ser Leu Thr Ser 
820 

Leu Ala Leu Ser Gin 
835 

Pro Asp He Phe Arg 
850 

Ser Gly Asp Ser Trp 
365 



425 

Leu Val He Lys Glu 
440 

Ser Pro Gly Ser His 
455 

Ser Lys Glu Asp He 
470 

Leu Ser Ser Ser Ser 
490 

Lys Gin He Ser Val 
505 

Asn Ala Tyr Glu Asp 
520 

Leu Ser Leu Glu Pro 
535 

Asp Phe Leu Pro Val 
550 

Leu Ala Trp Thr Gly 
570 

Lys He Asn Tyr Lys 
585 

Asn lie Leu Trp Gly 
600 

Gin Glu Thr His Ala 
615 

Asp Gly He Gly Asn 
630 

Arg Tyr Arg His Asn 
650 

He Thr Pro Lys His 
665 

Arg Asp Lys Asp Tyr 
680 

Gly Ser Tyr Leu Tyr 
695 

Tyr Ala Ser Arg Asn 
710 

Phe Leu Gin Asn Pro 
730 

His Ala Thr Asn Asp 
745 

Lys Asn Ser Trp Arg 
760 

Met Pro Leu Leu Val 
775 

Pro Phe Met Lys Leu 
790 

Glu Thr Thr Ala Asp 
810 

He Ser Val Pro Leu 
825 

Asp Val Leu Tyr Asp 
840 

Lys Asp Pro Ser Cys 
855 

Leu Val Pro Ala Ala 
870 



430 



Gly Ala 


Glu 


Val 


Thr 


Val 






445 








Leu 


Val 


Leu 


Asp 


Leu 


Gly 




460 










Ala 


He 


Thr 


Gly 


Leu 


Ala 


475 










480 


Thr 


Ala 


Ala 


Val 


He 


Lvs 










495 




Thr 


Asp 


Ser 


He 


Glu 


Leu 








510 






Leu 


Ara 


Met 


Arg 


Asn 


Ser 






525 








Gly 


Ala 


Gly Gly 


Ser 


val 




540 










Ser 


Pro 


nis 


Tyr 


Glv 


Phe 


555 










_J O v 


Thr 


Gly 


Asn 


Lys 


Val 


Gly 










575 




Pro 


Arcj 


Pro 


CjIU 


Lvs 


Glu 








C Q A 






Asn 


Ala 


va l 


Asn 


Val 


Arg 






n ^ 

O U J 








Ser 


Ser 


Leu 


Gin 


Thr 


Asp 




620 










Phe 


Phe 


nib 


v a i. 


Ser 


Ala 


635 










640 


Ser 


Gly 


Gly 


Tyr 


Val 


Leu 










655 




Tyr 


Thr 


Ser 




Ala 


Phe 








670 






Ala 


Val 


Ser 


Asn 


Asn 


Glu 






685 








Gin 


Tvr 


Thr 


lur 


Ser 


Leu 




700 










Pro 


Asn 


Val 


As n 


Val 


Glv 


715 










720 


Leu 


Met 


He 


Phe 


His 


Phe 










735 




Met 


Lys 


Thr 


Asp 


Tyr 


Ala 








750 






Asn 


Asn 


Cys 


Trp 


Ala 


He 






765 








Phe 


Glu 


Asn 


Gly 


Arg 


Leu 




780 










Gin 


Leu 


Val 


Tyr 


Ala 


Tvr 


795 










800 


Gly Arg Arg 


Phe 


Ser 


Asn 










815 




Gly 


He 


Arg 


Phe 


Glu 


Lys 








830 






Phe 


Ser 


Phe 


Ser 


Tyr 


He 






845 








Glu 


Ala 


Ala 


Leu 


Val 


He 




860 










His 


Val 


Ser 


Arg 


His 


Ala 


875 










880 
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Phe Val Gly Ser Gly Thr Gly Arg Tyr His Phe Asn Asp Tyr Thr Glu 

885 890 895 

Leu Leu" Cys Arg Gly Ser lie Glu Cys Arg Pro His Ala Arg Asn Tyr 

900 905 910 

Asn lie Asn Cys Gly Ser Lys Phe Arg Phe 
915 920 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 52 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGAAGATTC CACTCCGCTT TTTATTGATA TCATTAGTAC CTACGCTTTC TATGTCGAAT 60 

TTATTAGGAG CTGCTACTAC CGAAGAGCTA TCGGCTAGCA ATAGCTTCGA TGGAACTACA 12 0 

TCAACAACAA GCTTTTCTAG TAAAACAT C A TCGGCTACAG ATGGCACCAA TTATGTTTTT 180 

AAAGATTCTG TAGTTATAGA AAATGTACCC AAAACAGGGG AAACTCAGTC TACTAGTTGT 240 

TTTAAAAATG ACGCTGCAGC TGGAGATCTA AATTTCTTAG GAGGGGGATT TTCTTTCACA 3 00 

TTTAGCAATA TCGATGCAAC CACGGCTTCT GGAGCTGCTA TTGGAAGTGA AGCAGCTAAT 3 60 

AAGACAGTCA CGTTATCAGG ATTTTCGG C A CTTTCTTTTC TTAAATCCCC AG CAAGT ACA 42 0 

GTGACTAATG GATTGGGAGC TATCAATGTT AAAGGGAATT TAAG CCTATT GGATAATGAT 4 80 

AAGGTATTGA TTCAGGACAA TTTCTCAACA GGAGATGGCG GAGCAATTAA TTGTGCAGGC 54 0 

TCCTTGAAGA TCGCAAACAA TAAGTCCCTT TCTTTTATTG GAAATAGTTC TTCAACACGT 600 

GG CGGAGCG A TTCAT AC CAA AAACCTCACA CTAT CTTCTG GTGGGGAAAC T CTATTTC AG 660 

GGG AATACAG CGCCTACGGC TGCTGGTAAA GGAGGTGCTA TCGCGATTGC AG ACT CTGGC 72 0 

ACCCTATCCA TTTCTGGAGA CAGTGGCGAC ATTATCTTTG AAGG CAAT AC GATAGGAGCT 780 

ACAGGAACCG TCTCTCATAG TG CTATTG AT TTAGGAACTA G CGCTAAG AT AACTG CGTTA 84 0 

CGTGCTGCGC AAGGACATAC GATATACTTT TATGATCCGA TTACTGTAAC AGGATCGACA 900 

TCTGTTGCTG ATGCTCTCAA TATTAATAGC CCTGATACTG GAGATAACAA AGAGTATACG 960 

GGAACCATAG TCTTTTCTGG AGAGAAGCTC ACGGAGGCAG AAGCTAAAGA TGAGAAGAAC 102 0 

CGCACTTCTA AATTACTTCA AAATGTTG CT TTTAAAAATG GGACTGTAGT TTTAAAAGGT 1080 

GATGTCGTTT TAAGTGCGAA CGGTTTCTCT CAGGATGCAA ACTCTAAGTT GATTATGGAT 114 0 

TTAGGGACGT CGTTGGTTGC AAACACCGAA AGTATCGAGT TAACGAATTT GGAAATTAAT 12 00 

AT AG ACTCT C TCAGGAACGG GAAAAAGATA AAACTCAGTG CTGCCACAGC T C AG AAAG AT 12 60 

ATTCGTATAG ATCGTCCTGT TGTACTGGCA ATT AG CGATG AGAGTTTTTA TCAAAATGGC 1320 

TTTTTGAATG AGGACCATTC CTATGATGGG ATTCTTGAGT TAGATGCTGG GAAAGACATC 13 80 

GTGATTTCTG C AG ATT CT CG CAGTATAAAT GCTGTACAAT CTCCGTATGG CTATCAGGGA 144 0 ' 

AAGTGGACAA TCAATTGGTC TACTGATGAT AAGAAAGCTA CGGTTTCTTG GGCAAAGCAA 1500 

AGTTTTAATC CCACTGCTGA GCAGGAGGCT CCGTTAGTTC CTAATCTTCT TTGGGGTTCT 1560 

TTT AT AG AT G TTCGTCCCTT CCAAAATTTT ATAGAGCTAG GTACTGAAGG TGCTC CTTAC 1620 

GAAAAGAGAT TTTGGGTTGC AGGCATTTCC AATGTTTTGC ATAGGAG CGG TCGTGAAAAT 1680 

CAAAGGAAAT TCCGTCATGT GAGTGGAGGT GCTGTAGTAG GTGCTAGCAC GAGGATGCCG 174 0 

GGTGGTGATA CCTTGTCTCT GGGTTTTG CT CAG CTCTTTG CGCGTGACAA AGACTACTTT 1800 

ATGAATACCA ATTTCGCAAA GACCTACGCA GGATCTTTAC GTTTGCAGCA CGATG CTTCC 18 60 

CTATACTCTG TGGTGAGTAT CCTTTTAGGA GAGGGAGGAC TCCGCGAGAT CCTGTTGCCT 192 0 

TATGTTTCCA AGACTCTGCC GTGCTCTTTC TATGGGCAGC T TAG CT ACGG C C AT ACGG AT 1980 

CATCGCATGA AG AC CG AGTC TCTACCCCCC CCCCCCCCGA CGCTCTCGAC GG AT C ATACT 2 04 0 

TCTTGGGGAG GATATGTCTG GG CTG GAG AG CTGGGAACTC GAGTTGCTGT TGAAAATACC 2100 

AG CGG C AGAG GATTTTTCCG AGAGTACACT CCATTTGTAA AAGTCCAAGC TGTTTACTCG 216 0 

CGCCAAGATA GCTTTGTTGA ACTAGGAGCT ATCAGTCGTG ATTTTAGTGA TTCGCATCTT 222 0 

TATAACCTTG CGATTCCTCT TGGAATCAAG TTAGAGAAAC GGTTTGCAGA GCAATATTAT 2 2 80 
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CATGTTGTAG CGATGTATTC TCCAGATGTT TGTCGTAGTA ACCCCAAATG T ACG ACT AC C 2 340 

CTACTTTCCA ACCAAGGGAG TTGGAAGACC AAAGGTT CG A ACTTAGCAAG AC AGG CTGGT 24 00 

ATTGTT CAGG CCTCAGGTTT TCGATCTTTG GGAGCTGCAG CAGAGCTTTT CGGGAACTTT 2 4 60 

GGCTTTGAAT GGCGGGGATC TTCTCGTAGC TATAATGTAG ATGCGGGTAG CAAAATCAAA 2 520 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met 


Lys 


He 


Pro 


Le xi 


Arg 


fne 


Leu 


Leu 


I le 


Ser 


Leu 


Val 


Pro 


Thr 


Leu 


1 








5 










10 










15 




Ser 


Met 


C p» -r- 


noli 

2 0 


Leu 


Leu 




Ala 


Ala 
25 


Thr 


Thr 


Glu 


Glu 


Leu 
30 


Ser 


Ala 


Ser 


Asn 


Ser 


Phe 


As p 


G ly 




Thr 


Ser 


l nr 


Thr 


Ser 


Phe 


Ser 


Ser 


Lys 






35 










40 










45 






Thr 


Ser 


Ser 


Ala 


Thr 


Asp 




Thr 


Asn 


Tyr 


Val 


Phe 


Lys 


Asp 


Ser 


Val 




50 










55 










b 0 






Val 


He 


Glu 


Asn 


Val 


Pro 


Lys 


Thr 


Gly 


<j±U 


Thr 


Gin 


Ser 


Thr 


Ser 


Cys 


65 










70 










75 










80 


Phe 


Lvs 


Asn 


Asp 


Ala 


Ala 


Ala 


Gly Asp 


Leu 


Asn 


Phe 


Leu 


Gly 


Gly 


Gly 










85 










q n 

17 u 










95 


Phe 


Ser 


Phe 


Thr 


Phe 


Ser 


Asn 


He 


Asp 


Ala 


Thr 


Thr 


Ala 


Ser 


Gly 


Ala 








100 










105 










110 




Ala 


He 


Gly 


Ser 


Glu 


Ala 


Ala 


Asn 


Lys 


Thr 


Val 


Thr 


Leu 


Ser 


Gly 


Phe 






115 










12 0 










125 






Ser 


Ala 


Leu 


Ser 


Phe 


Leu 


Lys 


Ser 


Pro 


Ala 


Ser 


Thr 


Val 


Thr 


Asn 


Gly 




130 










135 










140 








Leu 


Gly 


Ala 


He 


Asn 


Val 


Lys 


Gly 


Asn 


Leu 


Ser 


Leu 


Leu 


Asp 


Asn 


Asp 


145 










150 










155 








160 


Lys 


Val 


Leu 


He 


Gin 


Asp 


Asn 


Phe 


Ser 


Thr 


Gly 


Asp 


Gly Gly Ala 


He 










165 










170 










175 




Asn 


Cys 


Ala 


Gly 
180 


Ser 


Leu 


Lys 


He 


Ala 
185 


Asn 


Asn 


Lys 


Ser 


Leu 
190 


Ser 


Phe 


He 


Gly 


Asn 


Ser 


Ser 


Ser 


Thr 


Arg 


Gly 


Gly 


Ala 


He 


His 


Thr 


Lys 


Asn 






195 










200 










205 






Leu 


Thr 


Leu 


Ser 


Ser 


Gly 


Gly 


Glu 


Thr 


Leu 


Phe 


Gin 


Gly Asn 


Thr 


Ala 




210 










215 










220 










Pro 


Thr 


Ala 


Ala 


Gly 


Lys 


Gly 


Gly 


Ala 


He 


Ala 


He 


Ala 


Asp 


Ser 


Gly 


225 










230 










235 








240 


Thr 


Leu 


Ser 


He 


Ser 
245 


Gly 


Asp 


Ser 


Gly 


Asp 
250 


He 


He 


Phe 


Glu 


Gly 
255 


Asn 


Thr 


He 


Gly Ala 


Thr 


Gly 


Thr 


Val 


Ser 


His 


Ser 


Ala 


He 


Asp 


Leu 


Gly 








260 










265 










270 




Thr 


Ser 


Ala 
275 


Lys 


He 


Thr 


Ala 


Leu 
280 


Arg 


Ala 


Ala 


Gin 


Gly 
285 


His 


Thr 


He 


Tyr 


Phe 


Tyr 


Asp 


Pro 


He 


Thr 


Val 


Thr 


Gly 


Ser 


Thr 


Ser 


Val 


Ala 


Asp 




290 










295 










300 








Ala 


Leu 


Asn 


He 


Asn 


Ser 


Pro 


Asp 


Thr 


Gly Asp 


Asn 


Lys 


Glu 


Tyr 


Thr 
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305 










310 










jlD 










32 0 


Glv 


Thr 


He 


Val 


php 

tr lie 


O CI 




Glu 


Lys 


Leu 


Thr 


Glu 


Ala 


Glu 


Ala 


Lys 










32 5 










j jU 










335 


Asp 


Glu 


Lys 


Asn 




Thr 




Lys 


Leu 


Leu 


Ljin 


Asn 


vai 


Ala 


Phe 


Lys 








340 










345 










350 




Asn 


Gly Thr 


Val 


Val 


Leu 


Lys 


Gly Asp 


vdl 


v a i 


Leu 


ser 


Ala 


Asn 


Gly 






-> -j j 










i r n 
J fa 0 










~i c c 

Job 






Phe 


DCl 


uin 


Asp 


r\±. CI 


Asn 




Lys 


Leu 


Tl Q 

lie 


Met 


Asp 


Leu 


Gly Thr 


Ser 




j / \j 




















*a a a 
j o 0 










Leu 


VcLl 


Aid 


Asn 


i ill. 


ulU 


Ser 


lie 


Glu 


Leu 


Thr 


Asn 


Leu 


Glu 


He 


Asn 


385 




















"3 Q C 

3 9b 










400 


lie 


Asp 


ber 


lieu 


Arg 


Asn 


uiy 


Lys 


Lys 


lie 


Lys 


Leu 


Ser 


Ala 


Ala 


Thr 










405 










H 1 U 










415 




Ala 


Gin 


Lys 


As p 




Arg 


Tic* 

ne 


Asp 


Arg 


Pro 


vai 


Val 


Leu 


Ala 


He 


Ser 








420 










425 










430 






Asp 


Glu 


Ser 


pv. (=> 

tr 1 lc 


lyr 


m n 
oin 


Asn 


Gly 


Phe 


Leu 


Asn 


Glu 


Asp 


His 


Ser 


Tyr 






4 J b 










440 










4 4 5 






Asp 


Gly 


lie 


Leu 


uiU 


Leu 


Asp 


Ala 


Gly 


Lys 


Asp 


He 


Val 


He 


Ser 


Ala 




4 b U 










4 ^ 

HDD 










4 60 










Asp 


Ser 


Arg 


Ser 


Tl « 

lie 


Asn 


Ala 


Val 


Gin 


Ser 


Pro 


Tyr 


Gly 


Tyr 


Gin 


Gly 


465 










4. 7 n 










4 / b 










480 


Lys 


Trp 


Thr 


lie 


Asn 


Trp 


ber 


Thr 


Asp 


Asp 


Lys 


Lys 


Ala 


Thr 


Val 


Ser 










4 ft ^ 

HOD 










490 










495 




Tm 


Ala 


Lys 




otr r 


rflc 


Asn 


Pro 


Thr 


Ala 


Glu 


Gin 


Glu 


Ala 


Pro 


Leu 








J u u 










505 










510 






v a x 


Pro 


Asn 


Leu 


Leu 


Trp 


Cjly 


Ser 


Phe 


He 


Asp 


Val 


Arg 


Pro 


Phe 


Gin 






515 










520 










525 








Asn 


Phe 


He 


UlU 


Leu 


L»iy 


Thr 


Glu 


Gly Ala 


Pro 


Tyr 


Glu 


Lys 


Arg 


Phe 




3-JU 










b J b 










540 








irp 


val 


Ala 


biy 


xi- 


O A -V- 


Asn 


Val 


Leu 


His 


Arg 


Ser 


Gly 


Arg 


Glu 


Asn 


54 5 










^ ^ n 










555 










560 




Arg 


Lys 


rile 


Arg 


HIS 


val 


Ser 


Gly 


Gly 


Ala 


Val 


Val 


Gly 


Ala 


Ser 










D O _> 










570 










575 




Thr 


Arg 


Met 


ST X tj 


uiy 


Gly 


Asp 


Thr 


Leu 


Ser 


Leu 


Gly 


Phe 


Ala 


Gin 


Leu 








580 










585 










590 






Phe 


■ft "1 

Ala 


Arg 


Asp 


Lys 


Asp 


Tyr 


Phe 


Met 


Asn 


Thr 


Asn 


Phe 


Ala 


Lys 


Thr 






C Q C 

b 9b 










600 










605 






Tvr 


Ala 


Gly 




Leu 


Arg 


Leu 


Gin 


His 


Asp 


Ala 


Ser 


Leu 


Tyr 


Ser 


Val 




O 1 u 










615 










620 










V CLX 


Cay 


lie 


Leu 


Leu 


Gly Glu Gly Gly Leu Arg Glu 


He 


Leu 


Leu 


Pro 


625 










630 










635 










640 


T\rr 
i y i- 


vai 


Ser 


Lys 


Thr 


Leu 


Pro 


Cys 


Ser 


Phe 


Tyr 


Gly Gin 


Leu 


Ser 


Tyr 










645 










650 










655 


Glv 
ox y 


His 


Thr 


Asp 


His 


Arg 


Met 


Lys 


Thr 


Glu 


Ser 


Leu 


Pro 


Pro 


Pro 


Pro 








660 










665 










670 






Pro 


Thr 


Leu 


Ser 


Thr 


Asp 


His 


Thr 


Ser 


Trp 


Gly Gly Tyr 


Val 


Trp 


Ala 






675 










680 










685 








Glv 


Glu 


Leu 


Gly Thr 


Arg 


Val 


Ala 


Val 


Glu 


Asn 


Thr 


Ser 


Gly 


Arg 


Gly 




690 










695 










700 




Phe 


Phe 


Arg 


Glu 


Tyr 


Thr 


Pro 


Phe 


Val 


Lys 


Val 


Gin 


Ala 


Val 


Tyr 


Ser 


705 










710 










715 








720 


Arg 


Gin 


Asp 


Ser 


Phe 


Val 


Glu 


Leu 


Gly Ala 


He 


Ser 


Arg 


Asp 


Phe 


Ser 










725 










730 










735 




Asp 


Ser 


His 


Leu 


Tyr 


Asn 


Leu 


Ala 


lie 


Pro 


Leu 


Gly 


He 


Lys 


Leu 


Glu 








740 










745 










750 






Lys 


Arg 


Phe 


Ala 


Glu 


Gin 


Tyr 


Tyr 


His 


Val 


Val 


Ala 


Met 


Tyr 


Ser 


Pro 



755 760 765 
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52 



Asp 


Val 
770 


Cys 


Arg 


Ser 


Asn 


Pro 
775 


Lys 


Cys 


Thr 


Thr Thr 
780 


Leu 


Leu 


Ser 


Asn 


Gin 


Gly 


Ser 


Trp 


Lys 


Thr 


Lys 


Gly 


Ser 


Asn 


Leu Ala 




m n 




oJLy 


785 










790 










795 






800 


He 


Val 


Gin 


Ala 


Ser 
805 


Gly 


Phe 


Arg 


Ser 


Leu 
810 


Gly Ala 


Ala 


Ala 


Glu 
815 


Leu 


Phe 


Gly 


Asn 


Phe 
820 


Gly 


Phe 


Glu 


Trp 


Arg 
825 


Gly 


Ser Ser 


Arg 


Ser 
830 


Tyr 


Asn 


Val 


Asp 


Ala 
835 


Gly 


Ser 


Lys 


He 


Lys 
840 


Phe 















(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ATGAAGTCTT CTTTCCCCAA GTTTGTATTT TCTACATTTG CTATTTTCCC TTTGTCTATG 60 

ATTGCTACCG AGACAGTTTT GGATTCAAGT GCGAGTTTCG ATGGGAATAA AAATGGTAAT 120 

TTTTCAGTTC GTGAGAGTCA GGAAGATGCT. GGAACTACCT ACCTATTTAA GGGAAATGTC 180 

ACT CT AG AAA ATATTCCTGG AACAGGCACA GCAATCACAA AAAGCTGTTT TAACAACACT 240 

AAGGGCGATT TGACTTTCAC AGGTAACGGG AACTCTCTAT TGTTCCAAAC GGTGGATGCA 300 

GGGACTGTAG CAGGGGCTGC TGTTAACAGC AGCGTGGTAG ATAAATCTAC CACGTTTATA 360 

GGGTTTTCTT CGCTATCTTT TATTGCGTCT CCTGGAAGTT CGATAACTAC CGGCAAAGGA 420 

GCCGT TAGC T GCTCTACGGG TAGCTTGAAG TTTGACAAAA ATGTCAGTTT GCTCTTC AG C 480 

AAAAACTTTT CAACGGATAA TGGCGGTG CT ATCACCGCAA AAACTCTTTC ATTAACAGGG 54 0 

ACTACAATGT CAGCTCTGTT TTCTGAAAAT ACCTCCTCAA AGAAAGGCGG AGCCATTCAG 600 

ACTTCCGATG CCCTTACCAT TACTGGAAAC CAAGGGGAAG TCTCTTTTTC TGACAATACT 660 

TCTTCGGATT CTGGAGCTGC AATTTTTACA GAAGCCTCGG TGACTATTTC TAATAATGCT 72 0 

AAAGTTTCCT TTATTGACAA TAAGGTCACA GGAGCGAGCT CCTCAACAAC GGGGGATATG 780 

TCAGGAGGTG CTATCTGTGC TTATAAAACT AGTACAGATA CTAAGGTCAC CCTCACTGGA 84 0 

AATCAGATGT TACTCTTCAG CAACAATACA TCGACAACAG CGGGAGGAGC TATCTATGTG 900 

AAAAAGCTCG AACTGGCTTC CGGAGGACTT ACCCTATTCA GTAGAAATAG TGTGAATGGA 96 0 

GGTACAGCTC CTAAAGGTGG AGCCATAGCT ATCGAAGATA GTGGGGAATT GAGTTTATCC 1020 

GCCGATAGTG GTGACATTGT CTTTTTAGGG AATACAGTCA CTTCTACTAC TCCTGGGACG 1080 

AATAGAAGTA GTATCGACTT AGGAACGAGT GCAAAGATGA CAGCTTTGCG TTCTGCTGCT 114 0 

GGTAGAGCCA TCTACTTCTA TGATCCCATA ACTACAGGAT CTTCCACAAC AGTTACAGAT 1200 

GTCTTAAAAG TTAATGAGAC TCCGGCAGAT TCTGCACTAC AATATACAGG GAACATCATC 126 0 

TTCACAGGAG AAAAGTTATC AGAGACAGAG GCCGCAGATT CTAAAAATCT TACTTCGAAG 132 0 

CTACTAC AG C CTGTAACTCT TTCAGGAGGT ACTCTATCTT TAAAACATGG AGTGACTCTG 1380 

CAGACTCAGG CATTCACTCA ACAGGCAGAT TCTCGTCTCG AAATGGACGT AGGAACTACT 144 0 

CTAGAACCTG CTGATACTAG CACCATAAAC AATTTGGTCA TTAACATCAG TTCTATAGAC 1500 

GGTGCAAAGA AGGCAAAAAT AGAAACCAAA GCTACGTCAA AAAATCTGAC TTTATCTGGA 1560 

ACCATCACTT TATTGGAC CC GACGGGCACG TTTTATGAAA ATCATAGTTT AAGAAATC CT 1620 

CAGTCCTACG ACATCTTAGA GCTCAAAGCT TCTGGAACTG TAACAAGCAC CGCAGTGACT 1680 

CCAG AT CCT A TAATGGGTGA GAAATTCCAT TACGGCTATC AGGGAACTTG GGGCCCAATT 1740 

GTTTGGGGGA CAGGGGCTTC TACGACTGCA ACCTTCAACT GGACTAAAAC TGGCTATATT 1800 

CCTAATCCCG AGCGTATCGG CTCTTTAGTC CCTAATAGCT TATGGAATGC ATTTATAGAT 1860 

ATTAGCTCTC TCCATTATCT TATGGAGACT GCAAACGAAG GGTTGCAGGG AGACCGTGCT 1920 

TTTTGGTGTG CTGGATTATC TAACTTCTTC CATAAGGATA GTACAAAAAC ACGACGCGGG 1980 

TTTCGCCATT TGAGTGGCGG TTATGTCATA GGAGGAAACC TACATACTTG TTCAGATAAG 2 040 
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ATTCTTAGTG CTGCATTTTG TCAGCTCTTT GGAAGAGATA GAGACTACTT TGTAGCTAAG 2100 

AAT CAAGGT A CAGTCTACGG AGGAACTCTC TATTACCAGC ACAACGAAAC CTATATCTCT 2160 

CTTCCTTGCA AACTACGGCC TTGTT CGTTG TCTTATGTTC CTACAGAGAT TCCTGTTCTC 2 220 

TTTTCAGGAA ACCTTAGCTA CACCCATACG GATAACGATC TGAAAACCAA GTATACAACA 22 80 

TAT CCTACTG TTAAAGGAAG CTGGGGGAAT GATAGTTTCG CTTTAGAATT CGGTGGAAGA 2340 

GCTCCGATTT GCTTAGATGA AAGTGCTCTA TTTGAG CAGT ACATGCCCTT CATGAAATTG 2 4 00 

CAGTTTGTCT ATGCACATCA GGAAGGTTTT AAAGAACAGG GAACAGAAGC TCGTGAATTT 24 60 

GGAAGTAGCC GTCTTGTGAA TCTTGCCTTA CCTATCGGGA TCCGATTTGA TAAGGAATCA 2 520 

GACTGCCAAG ATGCAACGTA CAATCTAACT CTTGGTTATA CTGTGGATCT TGTT CGTAGT 2 580 

AACCCCGACT GTACGACAAC ACTGCGAATT AGCGGTGATT CTTGGAAAAC CTTCGGTACG 2 64 0 

AATTTGGCAA GACAAGCTTT AGTCCTTCGT GCAGGGAACC ATTTTTG CTT TAACTCAAAT 2 700 

TTTGAAGCCT TTAG CCAATT TTCTTTTGAA TTGCGTGGGT CATCTCGCAA TTACAATGTA 2 76 0 
GACTTAGGAG CAAAATACCA ATTCTAA 



2787 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Lys Ser Ser Phe Pro Lys Phe Val Phe Ser Thr Phe Ala He Phe 

1 5 10 15 

Pro Leu Ser Met He Ala Thr Glu Thr Val Leu Asp Ser Ser Ala Ser 

20 25 30 

Phe Asp Gly Asn Lys Asn Gly Asn Phe Ser Val Arg Glu Ser Gin Glu 

35 40 45 

Asp Ala Gly Thr Thr Tyr Leu Phe Lys Gly Asn Val Thr Leu Glu Asn 

50 55 60 

He Pro Gly Thr" Gly Thr Ala He Thr Lys Ser Cys Phe Asn Asn Thr 
65 70 75 80 

Lys Gly Asp Leu Thr Phe Thr Gly Asn Gly Asn Ser Leu Leu Phe Gin 

85 90 95 

Thr Val Asp Ala Gly Thr Val Ala Gly Ala Ala Val Asn Ser Ser Val 

100 105 110 

Val Asp Lys Ser Thr Thr Phe He Gly Phe Ser Ser Leu Ser Phe He 

H5 120 125 

Ala Ser Pro Gly Ser Ser He Thr Thr Gly Lys Gly Ala Val Ser Cys 

130 135 140 

Ser Thr Gly Ser Leu Lys Phe Asp Lys Asn Val Ser Leu Leu Phe Ser 
145 150 155 160 

Lys Asn Phe Ser Thr Asp Asn Gly Gly Ala He Thr Ala Lys Thr Leu 

165 170 175 

Ser Leu Thr Gly Thr Thr Met Ser Ala Leu Phe Ser Glu Asn Thr Ser 

180 185 igo 

Ser Lys Lys Gly Gly Ala He Gin Thr Ser Asp Ala Leu Thr He Thr 

195 200 205 

Gly Asn Gin Gly Glu Val Ser Phe Ser Asp Asn Thr Ser Ser Asp Ser 

210 215 220 

Gly Ala Ala He Phe Thr Glu Ala Ser Val Thr He Ser Asn Asn Ala 
225 230 235 240 

Lys Val Ser Phe He Asp Asn Lys Val Thr Gly Ala Ser Ser Ser Thr 
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245 










Thr 


Gly 


Asp 


Met 


Ser 


Gly 


Gly 


Ala 


He 








260 










265 


Asp 


Thr 


Lys 


Val • 


Thr 


Leu 


Thr 


Gly 


Asn 






275 










280 




Asn 


Thr 


Ser 


Thr 


Thr 


Ala 


Gly 


Gly 


Ala 




290 










295 






Leu 


Ala 


Ser 


Gly 


Gly 


Leu 


Thr 


Leu 


Phe 


305 










310 








Gly Thr 


Ala 


Pro 


Lys 


Gly 


Gly 
i 


Ala 


He 










325 










Leu 


Ser 


Leu 


Ser 


Ala 


Asp 


Ser 


Glv 


Asp 








340 










345 


Val 


Thr 


Ser 


Thr 


Thr 


Pro 


Gly 


Thr 


Asn 






355 










360 




Thr 


Ser 


Ala 


Lys 


Met 


Thr 


Ala 


Leu 






370 










375 




Tyr 
i *■ 


Phe 


Tyr 


Asp 


Pro 


He 


Thr 


Thr 


Gly 


385 










390 






Val 


Leu 


Lys 


Val 


Asn 


Glu 


Thr 


Pro 












405 










Gly 


Asn 


He 


He 


Phe 


Thr 


Gly 


Glu 


Liys 








420 










425 


Asp 


Ser 


Lys 


Asn 


Leu 


Thr 


Ser 


Lys 


Leu 






435 










440 




Gly 


Gly 


Thr 


Leu 


Ser 


Leu 


Lvs 


His 


Glv 




450 










455 




Phe 


Thr 


Gin 


Gin 


Ala 


Asp 


Ser 


Arg 


Leu 


465 










470 








Leu 


Glu 


Pro 


Ala 


Asp 


Thr 


Ser 


Thr 


X xc 










485 










Ser 


Ser 


He 


Asp 


Gly 


Ala 


Lvs 


Lvs 


Ala 








500 










505 


Ser 


Lys 


Asn 


Leu 


Thr 


Leu 


Ser 


Glv 

*j x y 


Thr 






515 










520 




Gly Thr 


Phe 


Tyr 


Glu 


Asn 


His 


Ser 


Leu 




530 










535 






He 


Leu 


Glu 


Leu 


Lys 


Ala 


Ser 


Gly 


Thr 


545 










550 








Pro 


Asp 


Pro 


lie 


Met 


Gly 


Glu 


Lys 


Phe 










565 










Trp 


Gly 


Pro 


He 


Val 


Trp 


Gly 


Thr 


Gly 








580 










585 


Asn 


Trp 


Thr 


Lys 


Thr 


Gly 


Tvr 


He 


Pro 






595 










600 




Leu 


Val 


Pro 


Asn 


Ser 


Leu 


Trp 


Asn 


Ala 




610 










615 






His 


Tyr 


Leu 


Met 


Glu 


Thr 


Ala 


Asn 


Glu 


625 










630 








Phe 


Trp 


Cys 


Ala 


Gly 


Leu 


Ser 


Asn 


Phe 










645 










Thr 


Arg 


Arg 


Gly 


Phe 


Arg 


His 


Leu 


Ser 








660 










665 


Asn 


Leu 


His 


Thr 


Cys 


Ser 


Asp 


Lys 


He 






675 










680 




Leu 


Phe 


Gly 


Arg 


Asp 


Arg 


Asp 


Tyr 


Phe 



690 695 
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54 



250 










255 




Cys 


Ala 


Tyr 


Lys 


Thr 


Ser 


Thr 










270 






Gin 


Met 


Leu 


Leu 


Phe 


Ser 


Asn 








285 








He 


Tyr 


Val 


Lvs 


Lys 


Leu 


Glu 






300 










Ser 


Arg Asn 


Ser 


Val 


Asn 


wx y 




J X _} 










320 


Ala 


Tic* 
x x e 




Asp 


Ser 


Gly 


ox u 


330 










335 




He 


Vdl 


T-\ V> _v 


Leu 


Gly Asn 


Thr 










350 






Arcr 


oer 




He 


Asp 


Leu 


Ox y 








365 








Ser 


Ala 


Ala 


Gly 


Arg 


Ala 


lie 






"ion 
joU 










Ser 


O n. -w- 


Thr 


Thr 


Val 


Thr 


Asp 














Ann 


Asp 


Ser 


Ala 




Gin 


Tyr 


i nr 


410 










415 




Leu 


Ser 


Glu 


Thr 


Glu 


Ala 


tin 

m.i a 










430 






Leu 


r*l i-i 
L»xn 


Pro 


Val 


Thr 


Leu 










445 








Val 


inr 


Leu 


Gin 


Thr 


Gin 


/\ia 
















Glu 


Met 


Asp 


Val 


Gly Thr 


inr 




4 *7 <^ 










4 80 


Asn 


Asn 


Leu 


Vdl 


He 


Asn 


lie 


490 










495 




Lys 


lie 


CjIU 


Thr 


Lys 


Ala 


J. 11X 










510 






He 


1 ILL 


Leu 


Leu 


Asp 


Pro 


l nr 








525 








Arcr 


Asn 


Pro 


Gin 


Ser 


Tyr 


Asp 






540 










Val 


Thr 


Ser 


Thr 


Ala 


Val 


x nr 




555 










S n 

J o u 


His 


Tyr 


Gly 


Tvr 
x y x 


Gin 


Gly 


i nr 


570 










575 




Ala 


Ser 


Thr 


Thr 


Ala 


Thr 


IT HC 










590 






Asn 


Pro 


Glu 


Arar 


He 


Gly 


Q O T- 

O C X 








605 








Phe 


He 


Asp 


He 


Ser 


Ser 


lieu 






620 










Gly 


Leu 


Gin 


Glv 

vjx y 


Asp 


Arg 


m.x a 




635 










640 


Phe 


His 


Lys 


Asp 


Ser 


Thr 


Lys 


650 










655 




Gly 


Gly Tyr 


Val 


He 


Gly 


Gly 










670 






Leu 


Ser 


Ala 


Ala 


Phe 


Cys 


Gin 








685 








Val 


Ala 


Lys 


Asn 


Gin 


Gly 


Thr 



700 
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Val 


Tvr 

-* 


Glv 


Glv 


Thr 


Leu 


Tyr 


Tyr 


(jin 


705 










710 








Leu 


Pro 


Cvs 


Lys 


Leu 
725 


Arg 


Pro 


Cys 




lie 


Pro 


Val 


Leu 


Phe 


Ser 


Gly Asn 


Leu 








740 










745 


Asp 


Leu 


Lvs 


Thr 


Lys 


xyr 


Thr 


Thr 


Tyr 






755 










760 


Gly 


Asn 
770 


Asd 


Ser 


Phe 


Ala 


Leu 
775 


Glu 


rile 


Leu 


Asp 


Glu 


Ser 


Ala 


Leu 


Phe 


Glu 


bill 


785 










790 








Gin 


Phe 


Val 


j. yi. 


Ala 
805 


His 


Gin 


Glu 


Gly 


Ala 




Glu 


Phe 


Gly Ser 


Ser 


Arg 


Leu 








820 










825 


Glv 


lie 


^ L y 


tr 11C 


Asp 


Lys 


Glu 


Ser 


Asp 






835 










840 


Leu 


Thr 
850 


Leu 


Gly 


Tyr 


Thr 


Val 
855 


Asp 


Leu 


Thr 


Thr 


Thr 


Leu 


Arg 


He 


Ser 


Gly Asp 


865 










870 








Asn 


Leu 


Ala 


Arg 


Gin 
885 


Ala 


Leu 


Val 


Leu 


Phe 


Asn 


Ser 


Asn 
900 


Phe 


Glu 


Ala 


Phe 


Ser 
905 


Gly 


Ser 


Ser 
915 


Arg 


Asn 


Tyr 


Asn 


Val 
920 


Asp 



(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 




PCT/DK98/00266 



55 



His 


Asn 


Glu 


Thr 


Tyr 


lie 


Ser 




715 










720 


Leu 


Ser 


Tyr 


Val 


Pro 


Thr 


Glu 


/3 0 










735 




O a 


Tyr 


Thr 


His 


Thr 


Asp 


Asn 










750 






Pro 


Thr 


Val 


Lys 


Gly 


Ser 


Trp 








765 








Gly 


Gly Arg Ala 


Pro 


lie 


Cys 






780 










Tyr 


Met 


Pro 


Phe 


Met 


Lys 


Leu 




795 










800 


Phe 


Lys 


Glu 


Gin 


Gly 


Thr 


Glu 


810 










815 




Val 


Asn 


Leu 


Ala 


Leu 


Pro 


He 










830 






Cys 


Gin 


Asp 


Ala 


Thr 


Tyr 


Asn 








845 








Val 


Arg 


Ser 


Asn 


Pro 


Asp 


Cys 






860 










Ser 


Trp 


Lys 


Thr 


Phe 


Gly 


Thr 




875 










880 


Arg 


Ala 


Gly 


Asn 


His 


Phe 


Cys 


890 










895 




Gin 


Phe 


Ser 


Phe 


Glu 


Leu 


Arg 










910 






Leu 


Gly Ala 


Lys 


Tyr 


Gin 


Phe 



925 



NO: 11: 



• (ii) MOLECULE TYPE: Genomit 

(xi) SEQUENCE DESCRIPTION: 

ATGAGATCGT CTTTTTCCTT GTTATTAATA 
AGTGTTTCTG CAGATGCTGC CGATCTCACA 
ACAAGCACCA CAGAATTTAC TCCTAAAGCG 
ATTCTCGATG GGGATGTCTC GATAAGCCAA 
TGTTTTTCTA ACACTGCAGG AAATCTTACC 
GACAATATTA TTTCGTCTAC TGTTG CAGGT 
ATTACGAAAT TCTCAGGATT TTCAACTCTT 
AAAGGAGCCA TTAAAATTAC CGATGGTCTG 
AATGAAAATG CCTCTAGTGA AAATGGGGGA 
GGGAGTACGC GGTTTGTAGC GTTCCTTGGC 
TATGCTTCTG GTG ACT CTGT GATTTCTGAG 
AGTGCGACAA CATCAGGAGG CGCGATCTCT 
CAAAATATCT TTTTCGATGG CTGCAAAGCA 
AAAGCAGGGG CGAACCCAGA CCCTATCTTG 
CTGAATAACA CAGCAGGAAA TAGTGGAGGT 
TCAGGACGAG GAGGAGTGTT ATTTTCTAAC 



: DNA 

SEQ ID NO: 11: 

TCTTCATCTC TAGCCTTTCC TCTCTTAATG 60 

TTAGGGAGTC GTGACAGTTA TAATGGTGAT 120 

GCAACTTCTG ATGCTAGTGG CACGACCTAT 180 

GCAGGGAAAC AAACG AG CTT AACCACAAGT 240 

TTCTTAGGGA ACGGATTTTC TCTTCATTTT 300 

GTTGTTGTTA GCAATACAGC AGCTT CTGGG 3 60 

CGGATGCTTG CAGCTCCTAG GACCACAGGT 420 

GTGTTTGAGA GTATAGGGAA TCTTGACCAA 4 80 

GCCATCAATA CGAAGACTTT GTCTTTGACT 540 

AATAGCTCGT CGCAACAAGG GGGAGCGATC 600 

AATGCAGGAA TCTTGAGCTT CGGAAACAAC 660 

GCTGAAGGGA ACCTTGTGAT CTCCAATAAC 720 

ACTACAAATG G CGG AGCTAT TGATTGTAAC 780 

ACT CTTT CAG GAAATGAGAG CCTGCATTTT 84 0 

GCGATTTATA CCAAAAAATT GGTGTT AT C C 900 

AACAAAGCTG CGAATGCTAC TCCTAAAGGA Q f.^ 
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GGGGCAATTG CGATTCTAGA TTCTGGAGAG ATTAGCATTT CTGCAGATCT CGG CAATATC 1020 

ATTTTCGAGG G CAAT ACT AC GAGCACTACA GGAAGTCCTG CGAGTGTGAC CAGAAATGCT 10 80 

ATAGATCTTG CATCGAATGC AAAATTTTTA AATCTCCGAG CGACTCGGGG AAATAAAGTT 1140 

ATTTTCTATG ATCCTATCAC GAGCTCAGGA GCTACTGATA AGCTCTCTTT GAATAAAGCT 1200 

GACGCAGGAT CTGGAAATAC CTATGAAGGC TACATCGTTT TCTCTGGAGA GAAACTCTCA 12 60 

GAAGAGGAAC TTAAGAAACC TGACAATCTG AAGT CTACAT TTACACAGGC TGT AG AG CTT 1320 

GCTGCAGGTG CCTTAGTATT GAAAGATGGA GTGACTGTAG TTGCAAATAC TATAACGCAG 13 80 

GTCGAGGGAT CGAAAGTCGT TATGGATGGA GGGACTACTT TTGAGGCAAG CGCTGAGGGG 144 0 

GTCACTCTCA ATGGCCTAGC CATTAATATA GATTCCTTAG ATGGGACAAA TAAAG CTATC 1500 

ATTAAGGCGA CGGCAGCAAG TAAGGATGTT GCCTTATCAG GGCCTATCAT G CTTGT AG AT 1560 

GCT CAGGGG A ACTATTATGA GCATCATAAT CTCAGTCAAC AGCAGGTCTT TCCTTTAATA 1620 

GAGCTTTCTG CACAAGGAAC GATGACTACT ACAGATATCC CCGATACCCC AATTCTAAAT 1680 

ACT ACG AAT C ACT ATGG GT A TCAAGGAACT GGAATAATTG TTTGGGTCGA CGATGCAACT 174 0 

GCAAAAACAA AAAATG CT AC CTTAACTTGG ACTAAAACAG G ATACAAGC C GAATC CAGAA 1800 

CGT CAGGGAC CTTTGGTTCC TAATAGCCTG TGGGGTTCTT TTGTCGATGT CCGCTCCATT 1860 

CAGAGC CTCA TGGACCGGAG CACAAGTTCG TTATCTTCGT CAACAAATTT GTGGGTATCA 1920 

GGAATCGCGG ACTTTTTGCA TGAAGATCAG AAAGGAAACC AACGTAGTTA T CGT C ATT CT 1980 

AGCGCGGGTT ATGCATTAGG AGGAGGATTC TTCACGGCTT CTGAAAATTT CTTTAATTTT 2 04 0 

GCTTTTTGTC AG CTTTTTGG CTACGACAAG GACCATCTTG TGGCTAAGAA CCATACCCAT 210 0 

GTATATGCAG GGGCAATGAG TT AC CG AC AC CTCGGAGAGT CTAAGAC CCT CGCTAAGATT 2160 

TTGTCAGGAA ATTCTGACTC CCTACCTTTT GTCTTCAATG CTCGGTTTGC TTATGGCCAT 2220 

ACCGACAATA ACATGACCAC AAAGTACACT GGCTATTCTC CTGTTAAGGG AAGCTGGGGA 2 2 80 

AATGATGCCT T C GGT AT AG A ATGTGGAGGA G CTATC CCGG TAGTTGCTTC AGGACGTCGG 234 0 

TCTTGGGTGG ATACCCACAC GCCATTTCTA AACCTAGAGA TGATCTATGC ACATCAGAAT 2400 

GACTTTAAGG AAAACGGCAC AGAAGGCCGT TCTTTCCAAA GTGAAGACCT CTTCAATCTA 2460 

GCGGTTCCTG TAGGGATAAA ATTTGAGAAA TTCTCCGATA AGTCTACGTA TGATCTCTCC 2 52 0 

AT AG CTT ACG TTCCCGATGT GATTCGTAAT GAT CCAGG CT GCACGACAAC TCTTATGGTT 2 580 

TCTGGGGATT CTTGGTCGAC ATGTGGTACA AGCTTGTCTA GACAAGCTCT TCTTGTACGT 2 64 0 

GCTGGAAATC ATCATGCCTT TGCTTCAAAC TTTGAAGTTT TCAGTCAGTT TGAAGTCGAG 2 700 

TTGCGAGGTT CTTCTCGTAG CTATGCTATC GATCTTGGAG GAAGATTCGG ATTTTAA 2 757 

(2) INFORMATION FOR SEQ ID NO; 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
i (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Arg Ser Ser Phe Ser Leu Leu Leu lie Ser Ser Ser Leu Ala Phe 

15 10 15 

Pro Leu Leu Met Ser Val Ser Ala Asp Ala Ala Asp Leu Thr Leu Gly 

20 25 30 

Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr Thr Glu Phe Thr Pro 

35 40 45 

Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr Tyr He Leu Asp Gly 

50 55 60 

Asp Val Ser He Ser Gin Ala Gly Lys Gin Thr Ser Leu Thr Thr Ser 
65 70 75 80 

Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe Leu Gly Asn Gly Phe 

85 90 95 

Ser Leu His Phe Asp Asn He He Ser Ser Thr Val Ala Gly Val Val 
100 105 110 
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Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys Phe Ser Gly Phe Ser 

115 120 125 

Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr Gly Lys Gly Ala lie 

130 135 140 

Lys He Thr Asp Gly Leu Val Phe Glu Ser He Gly Asn Leu Aso Gin 
145 150 155 ~ 160 

Asn Glu Asn Ala Ser Ser Glu Asn Gly Gly Ala He Asn Thr Lys Thr 

165 170 175 

Leu Ser Leu Thr Gly Ser Thr Arg Phe Val Ala Phe Leu Gly Asn Ser 

180 185 190 

Ser Ser Gin Gin Gly Gly Ala He Tyr Ala Ser Gly Asp Ser Val He 

195 200 205 

Ser Glu Asn Ala Gly He Leu Ser Phe Gly Asn Asn Ser Ala Thr Thr 

210 215 220 

Ser Gly Gly Ala He Ser Ala Glu Gly Asn Leu Val He Ser Asn Asn 
225 230 235 240 

Gin Asn He Phe Phe Asp Gly Cys Lys Ala Thr Thr Asn Gly Gly Ala 

245 250 255 

He Asp Cys Asn Lys Ala Gly Ala Asn Pro Asp Pro He Leu Thr Leu 

260 265 270 

Ser Gly Asn Glu Ser Leu His Phe Leu Asn Asn Thr Ala Gly Asn Ser 

275 280 285 

Gly Gly Ala He Tyr Thr Lys Lys Leu Val Leu Ser Ser Gly Arg Gly 

290 295 300 

Gly Val Leu Phe Ser Asn Asn Lys Ala Ala Asn Ala Thr Pro Lys Gly 
305 310 315 320 

Gly Ala He Ala He Leu Asp Ser Gly Glu He Ser He Ser Ala Asp 

325 330 335 

Leu Gly Asn He He Phe Glu Gly Asn Thr Thr Ser Thr Thr Gly Ser 

340 345 350 

Pro Ala Ser Val Thr Arg Asn Ala He Asp Leu Ala Ser Asn Ala Lys 

355 360 365 

Phe Leu Asn Leu Arg Ala Thr Arg Gly Asn Lys Val He Phe Tyr Asp 

370 375 380 

Pro He Thr Ser Ser Gly Ala Thr Asp Lys* Leu Ser Leu Asn Lys Ala 
385 390 395 400 

Asp Ala Gly Ser Gly Asn Thr Tyr Glu Gly Tyr He Val Phe Ser Gly 

405 410 415 

Glu Lys Leu Ser Glu Glu Glu Leu Lys Lys Pro Asp Asn Leu Lys Ser 

420 425 430 

Thr Phe Thr Gin Ala Val Glu Leu Ala Ala Gly Ala Leu Val Leu Lys 

435 440 445 

Asp Gly Val Thr Val Val Ala Asn Thr He Thr Gin Val Glu Gly Ser 

450 455 460 

Lys Val Val Met Asp Gly Gly Thr Thr Phe Glu Ala Ser Ala Glu Gly 
465 470 475 480 

Val Thr Leu Asn Gly Leu Ala He Asn He Asp Ser Leu Asp Gly Thr 

485 490 495 

Asn Lys Ala He He Lys Ala Thr Ala Ala Ser Lys Asp Val Ala Leu 

500 505 510 

Ser Gly Pro He Met Leu Val Asp Ala Gin Gly Asn Tyr Tyr Glu His 

515 520 525 

His Asn Leu Ser Gin Gin Gin Val Phe Pro Leu He Glu Leu Ser Ala 

530 535 540 

Gin Gly Thr Met Thr Thr Thr Asp He Pro Asp Thr Pro He Leu Asn 
545 550 555 560 

Thr Thr Asn His Tyr Gly Tyr Gin Gly Thr Gly He He Val Trp Val 
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565 570 575 

Asp Asp Ala Thr Ala Lys Thr Lys Asn Ala Thr Leu Thr Trp Thr Lvs 

580 585 590 

Thr Gly Tyr Lys Pro Asn Pro Glu Arg Gin Gly Pro Leu Val Pro Asn 

595 600 60S 

Ser Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Ser Leu Met 

610 615 620 

Asp Arg Ser Thr Ser Ser Leu Ser Ser Ser Thr Asn Leu Trp Val Ser 
625 "0 635 640 

Gly lie Ala Asp Phe Leu His Glu Asp Gin Lys Gly Asn Gin Arg Ser 

645 650 655 

Tyr Arg His Ser Ser Ala Gly Tyr Ala Leu Gly Gly Gly Phe Phe Thr 

660 665 670 

Ala Ser Glu Asn Phe Phe Asn Phe Ala Phe Cys Gin Leu Phe Gly Tyr 

675 680 685 

Asp Lys Asp His Leu Val Ala Lys Asn His Thr His Val Tyr Ala Glv 

690 695 700 

Ala Met Ser Tyr Arg His Leu Gly Glu Ser Lys Thr Leu Ala Lys lie 
705 71 ° 7X5 720 

Leu Ser Gly Asn Ser Asp Ser Leu Pro Phe Val Phe Asn Ala Arg Phe 

725 730 735 

Ala Tyr Gly His Thr Asp Asn Asn Met Thr Thr Lys Tyr Thr Gly Tyr 

740 745 750 

Ser Pro Val Lys Gly Ser Trp Gly Asn Asp Ala Phe Gly He Glu Cvs 

755 760 765 

Gly Gly Ala He Pro Val Val Ala Ser Gly Arg Arg Ser Trp Val Asd 

770 775 780 

Thr His Thr Pro Phe Leu Asn Leu Glu Met He Tyr Ala His Gin Asn 
785 790 795 aoo 

Asp Phe Lys Glu Asn Gly Thr Glu Gly Arg Ser Phe Gin Ser Glu Asp 

805 810 81S 

Leu Phe Asn Leu Ala Val Pro Val Gly He Lys Phe Glu Lys Phe Ser 

820 325 830 

Asp Lys Ser Thr Tyr Asp Leu Ser He Ala Tyr Val Pro Asp Val He 

835 840 8 4 5 

Arg Asn Asp Pro Gly Cys Thr Thr Thr Leu Met Val Ser Gly Asd Ser 

850 855 860 

Trp Ser Thr Cys Gly Thr Ser Leu Ser Arg Gin Ala Leu Leu Val Arg 
865 870 875 880 

Ala Gly Asn His His Ala Phe Ala Ser Asn Phe Glu Val Phe Ser Gin 

885 890 895 

Phe Glu Val Glu Leu Arg Gly Ser Ser Arg Ser Tyr Ala He Asp Leu 

900 90S 910 

Gly Gly Arg Phe Gly Phe 
915 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
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ATGAAATCCT CTCTTCATTG GTTTGTAATC TCGTCATCTT TAGCACTTCC CTTGTCACTA 6 0 

AATTTCTCTG CGTTTGCTGC TGTTGTTGAA ATCAAT CTAG GACCTACCAA TAGCTTCTCT 12 0 

GGACCAGGAA CCTACACTCC TCCAGCCCAA ACAACAAATG CAGATGGAAC TATCTATAAT 180 

CTAACAGGGG ATGT CTCAAT CACCAATGCA GGATCTCCGA CAGCTCTAAC CGCTTCCTGC 24 0 

TTTAAAGAAA CTACTGGGAA TCTTTCTTTC CAAGGC CACG GCTACCAATT TCTCCTACAA 300 

AATATCGATG CGGGAGCGAA CTGTACCTTT ACCAATACAG CTGCAAATAA GCTTCTCTCC 3 60 

TTTT CAGG AT TCTCCTATTT GTCACTAATA C AAAC CACG A ATGCTACCAC AG GAACAGG A 420 

GCCATCAAGT CCACAGGAGC TTGTTCTATT CAGTCGAACT ATAGTTG CTA CTTTGGCCAA 4 80 

AACTTTTCTA ATGACAATGG AGGCGCCCTC CAAGGCAGCT CTAT CAGTCT AT CGCTAAAC 54 0 

CCCAACCTAA CGTTTGCCAA AAACAAAG C A ACGCAAAAAG GGGGTGCCCT CTATTCCACG 600 

GGAGGGATTA CAATTAACAA TACGTTAAAC TCAGCATCAT TTTCTGAAAA TACCGCGGCG 660 

AACAATGGCG GAGCCATTTA CACGGAAGCT AGCAGTTTTA TT AG C AG CAA CAAAGCAATT 720 

AG CTTT AT AA ACAATAGTGT GACCGCAACC T C AG CT AC AG GGGGAGCCAT TTACTGTAGT 780 

AGTACATCAG CCCCCAAACC AGTCTTAACT CTAT C AG AC A ACGGGGAACT GAACTTTATA 84 0 

GGAAATACAG CAATTACTAG TGGTGGGGCG ATTTATACTG ACAATCTAGT TCTTTCTTCT 900 

GGAGGACCTA CGCTTTTTAA AAACAACT CT GCTATAGATA CTGCAGCTCC CTTAGGAGGA 960 

GCAATTGCGA TTGCTGACTC TGGATCTTTG AGTCTTTCGG CTCTTGGTGG AGACATCACT 1020 

TTTGAAGGAA ACACAGTAGT C AAAGG AG CT TCTTCGAGTC AGAC CACTAC C AG AAATT CT 1080 

ATTAACATCG G AAAC AC CAA TGCTAAGATT GTACAGCTGC GAGCCTCTCA AGGCAATACT 114 0 

ATCTACTTCT ATGATCCTAT AACAACTAAC CATACTGCAG CTCTCTCAGA TGCTCTAAAC 1200 

TTAAATGGTC CTGACCTTGC AGGGAATCCT GCATAT CAAG G AAC CAT CGT ATTTTCTGGA 12 60 

GAGAAGCTCT CGGAAGCAGA AGCTGCAGAA GCTGATAATC TCAAATCTAC AATTCAGCAA 1320 

CCTCTAACTC TTGCGGGAGG GCAACTCTCT CTTAAATCAG GAGTCACTCT AGTTG CTAAG 13 80 

TCCTTTTCGC AATCTCCGGG CTCTACCCTC CTCATGGATG CAGGGACCAC ATTAGAAACC 14 4 0 

GCTGATGGGA T CACT AT CAA TAATCTTGTT CTCAATGTAG ATTC CTT AAA AGAGACCAAG 1500 

AAGGCTACGC TAAAAGCAAC ACAAGCAAGT CAGACAGTCA CTTTATCTGG ATCGCTCTCT 1560 

CTTGT AG AT C CTTCTGGAAA TGTCTACGAA GATGTCTCTT GGAATAACCC TCAAGTCTTT 162 0 

TCTTGTCTCA CTCTTACTGC TGACGACCCC GCGAATATTC ACATCACAGA CTTAGCTGCT 1680 

GATCCCCTAG AAAAAAATCC TATCCATTGG GGATACCAAG GGAATTGGGC ATTAT CTTGG 174 0 

CAAGAGGATA CTGCGACTAA AT C CAAAG C A GCGACTCTTA C CTGG AC AAA AA CAGG AT AC 1800 

AATCCGAATC CTGAGCGTCG TGGAACCTTA GTTGCTAACA CGCTATGGGG AT CCTTTGTT 1860 

GATGTGCGCT CCATACAACA GCTTGTAGCC ACTAAAGTAC GCCAATCTCA AGAAACTCGC 192 0 

GGCATCTGGT GTGAAGGGAT CTCGAACTTC TTC CAT AAAG ATAGCACGAA GATAAATAAA 19 80 

GGTTTTCGCC ACATAAGTGC AGGTTATGTT GTAGGAGCGA CTACAACATT AG CTT CTG AT 2 04 0 

AATCTTATCA CTGCAGCCTT CTGCCAATTA TTCGGGAAAG ATAGAGATCA CTTTATAAAT 2100 

AAAAATAGAG CTTCTGCCTA TG CAG CTT CT CTCCATCTCC AGCATCTAGC G ACCTTGT CT 2160 

TCTCCAAGCT TGTTACGCTA CCTTCCTGGA T CTG AAAGTG AGCAGCCTGT CCTCTTTGAT 2 220 

GCTCAGATCA G CTAT AT CTA TAGTAAAAAT ACTATGAAAA C CT ATT AC AC CCAAGCACCA 2 2 80 

AAGGGAGAGA GCTCGTGGTA TAATGACGGT TGCGCTCTGG AACTTG CGAG CTCCCTACCA 2 34 0 

CACACTGCTT TAAG CCATGA GGGTCTCTTC CACGCGTATT TT C CTTT CAT CAAAGTAGAA 2 400 

GCTTCGTACA TACACCAAGA TAGCTTCAAA GAACGTAATA CTACCTTGGT ACGAT CTTT C 2 460 

GATAGCGGTG ATTTAATTAA CGTCTCTGTG CCTATTGGAA TTACCTTCGA GAGATTCTCG 2 520 

AGAAACG AG C GTGCGTCTTA CGAAGCTACT GTCATCTACG TTGCCGATGT CTATCGTAAG 2 580 * 

AATCCTGACT GCACGACAGC TCTCCTAATC AACAATACCT CGTGGAAAAC TACAGGAACG 2 64 0 

AATCTCTCAA GACAAGCTGG TATCGGAAGA GCAGGGATCT TTTATGCCTT CTCTCCAAAT 2 70 0 

CTTGAGGTCA CAAGTAACCT ATCTATGGAA ATT CGTGG AT CTTCACGCAG CT ACAATG C A 2 760 

GAT CTTGG AG GTAAGTTCCA GTTCTAA 2 7 87 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



SUBSTITUTE SHEET (RULE 26) 



WO 98/58953 



60 



PCT/DK98/00266 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met 


Lys 


Ser 


Ser 


Leu 


His 


Trp 


Phe 


Val 


He 


Ser 


Ser 


Ser 


Leu 


Ala 


Leu 


1 








5 










10 










15 




Pro 


Leu 


Ser 


Leu 


Asn 


Phe 


Ser 


Ala 


Phe 


Ala 


Ala 


Val 


Val 


Glu 


He 


Asn 








20 










25 










30 






Leu 


Gly 


Pro 


Thr 


Asn 


Ser 


Phe 


Ser 


Gly 


Pro 


Gly Thr 


Tvr 


Thr 


Pro 


Pro 






35 










40 










45 








Ala 


Gin 


Thr 


Thr 


Asn 


Ala 


Asp 


Gly Thr 


He 


Tyr 


Asn 


Leu 


Thr 


Gly 


Asp 




50 










55 










60 






Val 


Ser 


He 


Thr 


Asn 


Ala 


Gly 


Ser 


Pro 


Thr 


Ala 


Leu 


Thr 


Ala 


Ser 


Cys 


65 










70 










75 










80 


Phe 


Lys 


Glu 


Thr 


Thr 


Gly 


Asn 


Leu 


Ser 


Phe 


Gin 


Gly 


His 


Gly Tyr 


Gin 










85 










90 










95 




Phe 


Leu 


Leu 


Gin 


Asn 


He 


Asp 


Ala 


Gly Ala 


Asn 


Cys 


Thr 


Phe 


Thr 


Asn 








100 










105 










110 






Thr 


Ala 


Ala 


Asn 


Lys 


Leu 


Leu 


Ser 


Phe 


Ser 


Gly 


Phe 


Ser 


Tyr 


Leu 


Cpv 






115 










120 










125 






Leu 


lie 


Gin 


Thr 


Thr 


Asn 


Ala 


Thr 


Thr 


Gly Thr 


Gly Ala 


He 


Lys 






130 










135 










140 








Thr 


Gly 


Ala 


Cys 


Ser 


He 


Gin 


Ser 


Asn 


Tyr 


Ser 


Cys 


Tyr 


Phe 


Gly 


uin 


145 










150 










155 








1 £ C\ 
1DU 


Asn 


Phe 


Ser 


Asn 


Asp 


Asn 


Gly 


Gly Ala 


Leu 


Gin 


Gly 


Ser 


Ser 


He 


OCX 










165 










170 










175 




Leu 


Ser 


Leu 


Asn 


Pro 


Asn 


Leu 


i. iii. 


Phe 


Ala 


Lys 


Asn 


Lys 


Ala 


Thr 










180 










185 










190 






Lys 


Gly 


Gly 


Ala 


Leu 


Tyr 


Ser 


Thr 


Gly Gly 


He 


Thr 


ne 


Asn 


Asn 


i. 1 IX 






195 










200 










205 








Leu 


Asn 


Ser 


Ala 


Ser 


Phe 


Ser 


Glu 


Asn 


Thr 


Ala 


Ala 


Asn 


Asn 


Gly 


\j J. y 




210 










215 










220 






Ala 


He 


Tyr 


Thr 


Glu 


Ala 


Ser 


Ser 


Phe 


He 


Ser 


Ser 


Asn 


Lys 


Ala 


Tip 

J. X. c 


225 










230 










235 








24 0 


Ser 


Phe 


He 


Asn 


Asn 


Ser 


Val 


Thr 


Ala 


Thr 


Ser 


Ala 


Thr 


Gly 


Gly 


Ala 










245 










250 








255 




He 


Tyr 


Cys 


Ser 


Ser 


Thr 


Ser 


Ala 


Pro 


Lys 


Pro 


Val 


Leu 


Thr 


Leu 


Ser 








260 










265 










270 






Asp 


Asn 


Gly 


Glu 


Leu 


Asn 


Phe 


He 


Gly Asn 


Thr 


Ala 


He 


Thr 


Ser 


Gly 






275 










280 










285 






Gly Ala 


He 


Tyr 


Thr 


Asp 


Asn 


Leu 


Val 


Leu 


Ser 


Ser 


Gly 


Gly 


Pro 


Thr 




290 










295 










300 






Leu 


Phe 


Lys 


Asn 


Asn 


Ser 


Ala 


He 


Asp 


Thr 


Ala 


Ala 


Pro 


Leu 


Gly 


Gly 


305 










310 










315 








32 0 


Ala 


He 


Ala 


He 


Ala 


Asp 


Ser 


Gly 


Ser 


Leu 


Ser 


Leu 


Ser 


Ala 


Leu 


Glv 










325 










330 










335 


Gly Asp 


He 


Thr 


Phe 


Glu 


Gly 


Asn 


Thr 


Val 


Val 


Lys 


Gly 


Ala 


Ser 


Ser 








340 










345 










350 






Ser 


Gin 


Thr 


Thr 


Thr 


Arg 


Asn 


Ser 


He 


Asn 


He 


Gly 


Asn 


Thr 


Asn 


Ala 






355 










360 








365 








Lys 


He 


Val 


Gin 


Leu 


Arg 


Ala 


Ser 


Gin 


Gly Asn Thr 


He 


Tyr 


Phe 


Tyr 




370 










375 










380 








Asp 


Pro 


He 


Thr 


Thr 


Asn 


His 


Thr 


Ala 


Ala 


Leu 


Ser 


Asp 


Ala 


Leu 


Asn 


385 










390 










395 








400 


Leu 


Asn 


Gly 


Pro 


Asp 


Leu 


Ala 


Gly 


Asn 


Pro 


Ala 


Tyr 


Gin 


Gly Thr 


He 


Val 








405 










410 










415 




Phe 


Ser 


Gly 


Glu 


Lys 


Leu 


Ser 


Glu 


Ala 


Glu 


Ala 


Ala 


Glu 


Ala 


Asp 








420 










425 










430 




Asn 


Leu 


Lys 


Ser 


Thr 


He 


Gin 


Gin 


Pro 


Leu 


Thr 


Leu 


Ala 


Gly 


Gly 


Gin 
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435 440 445 

Leu Ser Leu Lys Ser Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin 

450 455 460 

Ser Pro Gly Ser Thr Leu Leu Met Asp .Ala Gly Thr Thr Leu Glu Thr 
465 470 475 480 

Ala Asp Gly lie Thr lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu 

485 490 495 

Lys Glu Thr Lys Lys Ala Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr 

500 505 510 

Val Thr Leu Ser Gly Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val 

515 520 525 

Tyr Glu Asp Val Ser Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr 

530 535 540 

Leu Thr Ala Asp Asp Pro Ala Asn He His He Thr Asp Leu Ala Ala 
545 550 555 560 

Asp Pro Leu Glu Lys Asn Pro He His Trp Gly Tyr Gin Gly Asn Trp 

565 570 575 

Ala Leu Ser Trp Gin Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr 

580 585 590 

Leu Thr Trp Thr Lys Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly 

595 600 605 

Thr Leu Val Ala Asn Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser 

610 615 620 

He Gin Gin Leu Val Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg 
625 630 635 640 

Gly He Trp Cys Glu Gly He Ser Asn Phe Phe His Lys Asp Ser Thr 

645 650 655 

Lys* He Asn Lys Gly Phe Arg His He Ser Ala Gly Tyr Val Val Gly 

660 665 670 

Ala Thr Thr Thr Leu Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys 

675 680 685 

Gin Leu Phe Gly Lys Asp Arg Asp His Phe He Asn Lys Asn Arg Ala 

690 695 700 

Ser Ala Tyr Ala Ala Ser Leu His Leu Gin His Leu Ala Thr Leu Ser 
705 710 " 715 720 

Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro 

725 730 735 

Val Leu Phe Asp Ala Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met 

740 745 750 

Lys Thr Tyr Tyr Thr Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn 

755 760 765 

Asp Gly Cys Ala Leu Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu 

770 775 780 

Ser His Glu Gly Leu Phe His Ala Tyr Phe Pro Phe He Lys Val Glu 
785 790 795 800 

Ala Ser Tyr He His Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu 

805 810 815 

Val Arg Ser Phe Asp Ser Gly Asp Leu He Asn Val Ser Val Pro He 

820 825 830 

Gly He Thr Phe Glu Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu 

835 840 845 

Ala Thr Val He Tyr Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys 

850 855 860 

Thr Thr Ala Leu Leu He Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala 
885 890 895 
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Phe Ser Pro Asn Leu Glu Val Thr Ser 
900 905 
Gly Ser' Ser Arg Ser Tyr Asn Ala Asp 



Asn Leu Ser Met Glu lie Arg 
910 



Leu Gly Gly Lys Phe Gin Phe 
925 



915 920 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2793 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGAAAATAC CCTTGCACAA ACTCCTGATC TCTTCGACTC TTGTCACTCC CATTCTATTG 6 0 

AGCATTGCAA CTTACGG AG C AGATG CTTCT TTATCCCCTA C AG AT AG CTT TGATGGAGCG 120 

GGCGGCTCTA CATTTACTCC AAAATCTACA GCAGATGCCA ATGGAACGAA CTATGTCTTA 180 

TCAGGAAATG TCTATATAAA CGATGCTGGG AAAGG C AC AG CATTAACAGG CTGCTGCTTT 24 0 

ACAGAAACTA CGGGTGATCT GACATTTACT GGAAAGGGAT ACTCATTTTC ATTCAACACG 3 00 

GT AGATG CG G GTTCGAATGC AGGAGCTGCG GCAAGCACAA CTGCTGATAA AGCCCTAACA 3 60 

TT C AC AGG AT TTTCTAACCT TT C CTTCATT GCAGCTCCTG GAACTACAGT TGCTTCAGGA 4 20 

AAAAGTACTT TAAGTTCTGC AGGAGCCTTA AATCTTACCG ATAATGGAAC GATTCTCTTT 4 80 

AGCCAAAACG TCTCCAATGA AGCTAATAAC AATGGCGGAG CGATCACCAC AAAAACTCTT 54 0 

TCTATTTCTG GGAATACCTC TTCTATAACC TTCACTAGTA AT AG CG C AAA AAAATTAGGT 600 

GGAGCGATCT ATAGCTCTGC GGCTGCAAGT ATTTCAGGAA ACACCGGCCA GTTAGTCTTT 6 60 

ATGAATAATA AAGGAGAAAC TGGGGGCGGG GCTCTGGGCT TTGAAGCCAG CTCCTCGATT 72 0 

ACTCAAAATA GCTCCCTTTT CTTCTCTGGA AACACTGCAA CAGATGCTGC AGGCAAGGGC 7 80 

GGGGCCATTT ATTGTGAAAA AACAGGAGAG ACTCCTACTC TT ACT AT CT C TGGAAATAAA 84 0 

AGTCTGACCT TCGCCGAGAA CTCTTCAGTA ACTCAAGGCG GAGCAATCTG TGCCCATGGT 900 

CT AG AT CTTT CCGCTGCTGG CCCTACCCTA TTTTCAAATA ATAGATGCGG GAACACAGCT 96 0 

GCAGGCAAGG GCGGCGCTAT TGCAATTGCC GACTCTGGAT CTTTAAGTCT CTCTGCAAAT 102 0 

CAAGGAGACA TCACGTTCCT TGGCAACACT CTAACCTCAA CCTCCGCGCC AACATCGACA 10 80 

CGGAATGCTA TCTACCTGGG ATCGTCAGCA AAAATTACGA ACTTAAGGGC AGCCCAAGGC 114 0 

CAATCTATCT ATTTCTATGA TCCGATTGCA TCTAACACCA CAGGAGCTTC AGACGTTCTG 12 0 0 

ACCATCAACC AACCGGATAG CAACTCGCCT TTAGATTATT CAGGAACGAT TGTATTTTCT 12 6 0 

GGGGAAAAGC T CTCTG C AG A TGAAGCGAAA GCTGCTGATA ACTTCACATC TATATTAAAG 132 0 

CAACCATTGG CTCTAGCCTC TGGAACCTTA GCACTCAAAG G AAATGT CG A GTTAGATGTC 13 8 0 

AATGGTTT C A CACAGACTGA AGGCTCTACA CTCCTCATGC AACCAGGAAC AAAGCTCAAA 14 4 0 

GCAGATACTG AAGCTATCAG TCTTACCAAA CTTGTCGTTG ATCTTTCTGC CTTAGAGGGA 150 0 

AATAAGAGTG TGTCCATTGA AAC AG C AGG A GCCAACAAAA CTATAACTCT AACCTCTCCT 156 0 - 

CTTGTTTTCC AAGATAGTAG CGGCAATTTT TATGAAAGCC ATACGATAAA CCAAGCCTTC 162 0 

ACGCAGCCTT TGGTGGTATT CACTGCTGCT ACTGCTG CT A G CGATATTT A TATCGATGCG 16 80 

CTTCTCACTT CTCCAGTACA AACTCCAGAA CCTCATTACG GGTATCAGGG ACATTGGGAA 174 0 

GCCACTTGGG CAGACACATC AACTG CAAAA TCAGGAACTA TGACTTGGGT AACTACGGGC 180 0 

TACAACCCTA ATCCTGAGCG TAGAGCTTCC GTAGTTCCCG ATTCATTATG GGCATCCTTT 1860 

ACTGACATTC G C ACTCT AC A GCAGATCATG ACATCT CAAG CGAATAGTAT CTATCAGCAA 192 0 

CGAGGACTCT GGGCATCAGG AACTG CGAAT TTCTTCCATA AGG AT AAAT C AGGAACTAAC 1980 

CAAG C ATT CC GACATAAAAG CTACGGCTAT ATTGTTGGAG GAAGTGCTGA AG ATTTTT CT 2 04 0 

G AAAAT AT CT TCAGTGTAGC TTTCTGCCAG CTCTTCGGTA AAGATAAAGA C CTGTTT AT A 210 0 

GTTGAAAATA CCTCTCATAA CTATTTAGCG T CG CT AT AC C TGCAACATCG AGCATTCCTA 216 0 

GGAGGACTTC CCATGCCCTC ATTTGGAAGT ATCACCGACA TGCTGAAAGA TATTCCTCTC 2220 

ATTTTGAATG CCCAGCTAAG CTACAGCTAC ACTAAAAATG ATATGGATAC T CGCT AT ACT 2280 

TCCTATCCTG AAGCTCAAGG TTCTTGGACC AATAATTCTG GGGCTCTAGA GCTCGGAGGA 2 34 0 

TCTCTGGCTC TATATCTCCC TAAAGAAGCA CCGTTCTTCC AGGGATATTT CCCCTTCTTA 2 4 00 
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AAGTTCCAGG CAGTCTACAG CCGCCAACAA AACTTTAAAG AGAGTGGCGC TGAAGCCCGT 246 0 

GCTTTTGATG ATGGAGACCT AGTGAACTGC TCTATCCCTG TCGGCATTCG GTTAGAAAAA 2 52 0 

ATCTCCGAAG ATGAAAAAAA TAATTT CG AG ATTTCTCTAG CCAACATTGG TGATGTGTAT 2 58 0 

CGTAAAAATC CCCGTTCGCG TACTTCTCTA ATGGTCAGTG GAGCCTCTTG GACTTCG CTA 2 64 0 

TGTAAAAACC TCGCACGACA AGCCTTCTTA GCAAGTGCTG GAAGCCATCT GACTCTCTCC 2 70 0 

CCTCATGTAG AACT CTCTGG GGAAGCTGCT TATGAGCTTC GTGGCTCAGC AC AC AT CT AC 2760 

AATGTAGATT GTGGGCTAAG ATACTCATTC TAG 2 79 3 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met 


Ly s 


Tip 

x -L e 


Pro 


Leu 


His 


Lys 


Leu 


Leu 


He 


Ser 


Ser 


Thr 


Leu 


Val 


Thr 


1 


He 






c 
-> 










10 










15 




Pro 


Leu 


JjSU 

20 




lie 


AI8 


l nr 


Tyr 
4, b 


Gly 


Ala 


Asp 


Ala 


Ser 
30 


Leu 


Ser 


Pro 


Thr 


Asp 


Ser 


rut. 


Asp 


n 1 it 


Ala 


Gly 


Gly 


Ser 


Thr 


Phe 


Thr 


Pro 


Lys 






3 5 ■ 










*± U 










45 






Ser 


Thr 


Ala 




AT ^ 


Asn 


pi 
uiy 


i nr 


Asn 


Tyr 


Val 


Leu 


Ser 


Gly 


Asn 


Val 




50 




















60 








Tyr 


He 


Asn 


Ren 


AT a 

r\J- cL 


uiy 


Lys 


(jly 


Thr 


Ala 


Leu 


Thr 


Gly 


Cys 


Cys 


Phe 


65 










70 










7 ^ 




8 0 


Thr 


Glu 


Thr 


Thr 


Gly 


Asp 


Leu 


Thr 


Phe 


Thr 


Gly 


Lys 


Gly Tyr 


Ser 


Phe 










85 










90 










95 




Ser 


Phe 


Asn 


Thr 


Val 


Asp 


Ala 


Gly 


Ser 


Asn 


Ala 


Gly 


Ala 


Ala 


Ala 


Ser 


Thr 






100 










105 










110 






Thr 


Ala 
115 


Asp 


Lys 


Ala 


Leu 


Thr 
120 


Phe 


Thr 


Gly 


Phe 


Ser 
125 


Asn 


Leu 


Ser 


Phe 


He 


Ala 


Ala 


Pro 


Gly 


Thr 


Thr 


Val 


Ala 


Ser 


Gly 


Lys 


Ser 


Thr 


Leu 




130 










135 










140 








Ser 


Ser 


Ala 


Gly 


Ala 


Leu 


Asn 


Leu 


Thr 


Asp 


Asn 


Gly Thr 


He 


Leu 


Phe 


145 










150 










155 










160 


Ser 


Gin 


Asn 


Val 


Ser 


Asn 


Glu 


Ala 


Asn 


Asn 


Asn 


Gly Gly Ala 


He 


Thr 










165 










170 










175 




Thr 


Lys 


Thr 


Leu 


Ser 


He 


Ser 


Gly 


Asn 


Thr 


Ser 


Ser 


He 


Thr 


Phe 


Thr 


Ser 






180 










185 










190 






Asn 


Ser 


Ala 


Lys 


Lys 


Leu 


Gly 


Gly 


Ala 


He 


Tyr 


Ser 


Ser 


Ala 


Ala 


Ala 




195 










200 










205 








Ser 


He 


Ser 


Gly 


Asn 


Thr 


Gly 


Gin 


Leu 


Val 


Phe 


Met 


Asn 


Asn 


Lys 




210 










215 










220 








Gly 


Glu 


Thr 


Gly 


Gly 


Gly 


Ala 


Leu 


Gly 


Phe 


Glu 


Ala 


Ser 


Ser 


Ser 


He 


225 










230 










235 










240 


Thr 


Gin 


Asn 


Ser 


Ser 


Leu 


Phe 


Phe 


Ser 


Gly 


Asn 


Thr 


Ala 


Thr 


Asp 


Ala 


Ala 








245 










250 










255 




Gly 


Lys 


Gly 


Gly 


Ala 


He 


Tyr 


Cys 


Glu 


Lys 


Thr 


Gly Glu 


Thr 


Pro 


Thr 






260 










265 










270 






Leu 


Thr 


He 


Ser 


Gly 


Asn 


Lys 


Ser 


Leu 


Thr 


Phe 


Ala 


Glu 


Asn 


Ser 


Ser 




275 










230 










285 








Val 


Thr 


Gin 


Gly 


Gly 


Ala 


He 


Cys 


Ala 


His 


Gly 


Leu 


Asp 


Leu 


Ser 
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290 295 300 

Ala Ala Gly Pro Thr Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala 
305 310 315 320 

Ala Gly Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser 

325 330 335 

Leu Ser Ala Asn Gin Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr 

340 345 350 

Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser 

355 360 365 

Ser Ala Lys He Thr Asn Leu Arg Ala Ala Gin Gly Gin Ser He Tyr 

370 375 380 

Phe Tyr Asp Pro He Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu 
385 390 395 400 

Thr He Asn Gin Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr 

405 410 415 

He Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala 

420 425 430 

Asp Asn Phe Thr Ser He Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly 

435 440 445 

Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr 

450 455 460 

Gin Thr Glu Gly Ser Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys 
465 470 475 480 

Ala Asp Thr Glu Ala He Ser Leu Thr Lys Leu Val Val Asp Leu Ser 

485 490 495 

Ala Leu Glu Gly Asn Lys Ser Val Ser He Glu Thr Ala Gly Ala Asn 

500 505 510 

Lys Thr He Thr Leu Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly 

515 520 525 

Asn Phe Tyr Glu Ser His Thr He Asn Gin Ala Phe Thr Gin Pro Leu 

530 535 540 

Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp He Tyr He Asp Ala 
545 550 555 560 

Leu Leu Thr Ser Pro Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin 

565 570 575 

Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly 

580 585 590 

Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg 

595 600 605 

Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arg 

610 615 620 

Thr Leu Gin Gin He Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin 
625 630 635 640 

Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys 

645 650 655 

Ser Gly Thr Asn Gin Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val 

660 665 670 

Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe 

675 680 685 

Cys Gin Leu Phe Gly Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr 

690 695 700 

Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu 
705 710 715 720 

Gly Gly Leu Pro Met Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys 

725 730 735 

Asp He Pro Leu lie Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys 
740 745 750 
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Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser 

755 760 765 

Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu 

770 775 780 

Tyr Leu Pro Lys Glu Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu 
785 790 795 800 

Lys Phe Gin Ala Val Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly 

805 810 815 

Ala Glu Ala Arg Ala Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He 

820 825 830 

Pro Val Gly He Arg Leu Glu Lys He Ser Glu Asp Glu Lys Asn Asn 

835 840 845 

Phe Glu lie Ser Leu Ala Asn He Gly Asp Val Tvr Arg Lvs Asn Pro 

850 855 860 

Arg Ser Arg Thr Ser Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu 
865 870 875 880 

Cys Lys Asn Leu Ala Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His 

"5 890 895 

Leu Thr Leu Ser Pro His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu 

900 905 910 

Leu Arg Gly Ser Ala His He Tyr Asn Val Asp Cys Gly Leu Arg Tyr 
915 920 925 

Ser Phe 
930 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GAAGACAATA TAAGGTACCG TCATAACAGC GGGGGTTATG CACTAGGGAT CACAGCAACA 60 

ACTCCTGCCG AGGATCAGCT TACTTTTGCC TTCTGCCAGC T CTTTG CT AG AGATCGCAAT 120 

CATATTACAG GTAAGAACCA CGGAGATACT TACGGTGCCT CTTTGTATTT CCACCATACA 180 

GAAGGGCTCT TCGACATCGC CAATTTCCTC TGGGGAAAAG CAACCCGAGC TCCCTGGGTG 24 0 

CTCTCTGAGA TCTCCCAGAT CATTCCTTTA TCGTTCGATG CTAAATTCAG TTATCTCCAT 300 

ACAGACAACC ACATGAAGAC AT ATT AT AC C GATAACTCTA TCATCAAGGG TTCTTGGAGA 3 60 

AACGATGCCT T CTGTGC AG A TCTTGGAGCT AGCCTGCCTT TTGTTATTTC CGTTCCGTAT 42 0 

CTTCTGAAAG AAGTCGAACC TTTTGTCAAA GTACAGTATA TCTATGCGCA TCAGCAAGAC 4 80 

TTCTACGAGC GTCATGCTGA AGGACGCGCT TTCAATAAAA GCGAGCTTAT CAACGTAGAG 54 0 

ATTCCTATAG GCGTCACCTT CGAAAGAGAC T CAAAAT C AG AAAAGGGAAC TTACGATCTT 600 

ACTCTTATGT ATATACTCGA TGCTTACCGA CGCAATCCTA AATGT CAAAC TTCCCTAATA 6 60 

GCTAGCGATG CTAACTGGAT GGCCTATGGT ACCAACCTCG CACGACAAGG TTTTT CTGTT 72 0 

CGTGCTGCGA ACCATTTCCA AGTGAACCCC CACATGGAAA TCTTCGGTCA ATTCGCTTTT 780 

GAAGTACGAA GTTCTTCACG AAATTATAAT ACAAACCTAG G CT CTAAGTT TTGTTTCTAG 84 0 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TYPE: amino acid 
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(C) STRAND EDN ESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 



Glu 


Asp 


Asn 


He 


Arg 


Tyr 


Arg 


His 


Asn 


Ser 


Gly 


Gly 


Tyr 


Ala 


Leu 


Gly 


1 
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He 


Thr 


Ala 


Thr 


Thr 


Pro 


Ala 


Glu 


Asp 


Gin 


Leu 


Thr 


Phe 


Ala 


Phe 


Cvs 








20 










25 
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Gin 


Leu 


Phe 


Ala 


Arg 


Asp 


Arg 


Asn 


His 


He 


Thr 


Gly 


Lys 


Asn 


His 


Glv 






35 










40 










45 






Asp 


Thr 


Tyr 


Gly 


Ala 


Ser 


Leu 


Tyr 


Phe 


His 


His 


Thr 


Glu 


Glv 


Leu 


Phe 




50 










55 










60 








Asp 


He 


Ala 


Asn 


Phe 


Leu 


Trp 


Gly 


Lys 


Ala 


Thr 


Arcr 


Ala 


Pro 


T rn 


Val 


65 










70 










75 








a n 

o w 


Leu 


Ser 


Glu 


He 


Ser 
85 


Gin 


He 


He 


Pro 


Leu 
90 


Ser 


Phe 


Asp 


Ala 


Lys 


p"h o 


Ser 


Tyr 


Leu 


His 


Thr 


Asp 


Asn 


His 


Met 


Lvs 


Thr 


Tvr 


Tvr 


Thr 


Asp 


Asn 








100 










105 
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Ser 


He 


He 


Lys 


Gly 


Ser 


Trp 


Arg 


Asn 


Asn 

r 


Ala 


Phe 


Cvs 


Ala 


Asp 


L6U 






115 










120 










125 






Gly 


Ala 


Ser 


Leu 


Pro 


Phe 


Val 


He 


Ser 


Val 


Pro 


Tyr 


Leu 


Leu 




Glu 




130 










135 










14 0 








Val 


Glu 


Pro 


Phe 


Val 


Lys 


Val 


Gin 


Tyr 


He 


Tvr 


Ala 


His 


Gin 




Asp 


145 










150 










155 










i cn 


Phe 


Tyr 


Glu 


Arg 


His 


Ala 


Glu 


Gly 


Arg 


Ala 


Phe 


Asn 


Lys 


Ser 


Glu 


Leu 










165 










170 








175 




He 


Asn 


Val 


Glu 


He 


Pro 


He 


Gly 


Val 


Thr 


Phe 


Glu 


Arg 


Asp 


Ser 


Lys 








180 










185 










190 




Ser 


Glu 


Lys 


Gly 


Thr 


Tyr 


Asp 


Leu 


Thr 


Leu 


Met 


Tyr 


He 


Leu 


Asp 


Ala 
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200 










205 






Tyr 


Arg 


Arg 


Asn 


Pro 


Lys 


Cys 


Gin 


Thr 


Ser 


Leu 


He 


Ala 


Ser 


Asp 


Ala 
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215 










220 








Asn 


Trp 


Met 


Ala 


Tyr 


Gly 


Thr 


Asn 


Leu 


Ala 


Arg 


Gin 


Gly 


Phe 


Ser 


Val 
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230 










235 










240 


Arg 


Ala 


Ala 


Asn 


His 


Phe 


Gin 


Val 


Asn 


Pro 


His 


Met 


Glu 


He 


Phe 


Gly 
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250 
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Gin 


Phe 


Ala 


' Phe 


Glu 


Val 


Arg 


Ser 


Ser 


Ser 


Arg 
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Tyr 


Asn 


Thr 


Asn 
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Gly 
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Lys 


Phe 


Cys 


Phe 





















(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATGACCATAC TTCGAAATTT TCTTACCTGC TCGGCTTTAT TCCTCGCTCT CCCTGCAGCA 
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GCACAAGTTG TATATCTTCA TGAAAGTGAT GGTTATAACG GTGCTATCAA TAATAAAAGC 120 

TTAGAACCTA AAATTACCTG TTATCCAGAA GGAACTTCTT A CAT CTTT CT AGATGACGTG 180 

AGGATTTCCA ACGTTAAGCA TGATCAAGAA GATGCTGGGG TTTTTATAAA TCGATCTGGG 24 0 

AAT CTTTTTT TCATGGGCAA CCGTTGCAAC TTCACTTTTC ACAACCTTAT GACCGAGGGT 3 00 

TTTGGCGCTG C CATTTCGAA CCGCGTTGGA GACACCACTC TCACTCTCTC T AATTTTT CT 3 60 

TACTTAACGT TCACCTCAGC ACCTCTACTA CCTCAAGGAC AAGG AG CGAT TTATAGTCTT 42 0 

GGTTCCGTGA TGATCGAAAA TAGTGAGGAA GTGACTTTCT GTGGGAACTA CTCTTCGTGG 4 80 

AGTGG AG CTG CGATTTATAC TCCCTACCTT TTAGGTTCTA AGGCGAGTCG TCCTTCAGTA 54 0 

AATCTCAGCG GGAACCGCTA CCTGGTGTTT AGAG ACT AT G TGAGCCAAGG TTATGGCGGC 600 

GCCGTATCTA CCCACAATCT CACACTCACG ACT CGAGGAC CTTCGTGTTT TGAAAATAAT 66 0 

CATGCTTATC ATGACGTGAA TAGTAATGGA GGAGCCATTG CCATTGCTCC TGGAGGATCG 72 0 

ATCTCTATAT C CGTG AAAAG CGGAGATCTC ATCTTCAAAG G AAATAC AG C AT CAC AAG AC 7 80 

GGAAATACAA TACACAACTC CATCCATCTG CAATCTGGAG CACAGTTTAA GAACCTACGT 84 0 

GCTGTTTCAG AATCCGGAGT TTATTTCTAT GAT C CT AT AA GCCATAGCGA GTCGCATAAA 900 

ATTACAGATC TTGTAATCAA TGCTCCTGAA GGAAAGGAAA CTTATGAAGG AACAATTAGC 960 

TTCT CAGG AC TATGCCTGGA TGATCATGAA GTTTGTGCGG AAAATCTTAC TTCCACAATC 102 0 

CTACAAGATG T CAC ATT AG C AGGAGGAACT CT C T CT CT AT CGGATGGGGT TACCTTGCAA 10 80 

CTG CATTCTT TT AAG CAGG A AG C AAG CT CT ACGCTTACTA TGTCTCCAGG AACCACTCTG 114 0 

CTCTGCTCAG GAGATGCTCG GGTTCAGAAT CTGCACATCC TGATTGAAGA TACCGACAAC 1200 

TTTGTT CCTG TAAGGATTCG CGCCGAGGAC AAGGATGCTC TTGTCTCATT AGAAAAACTT 12 60 

AAAGTTGCCT TTGAGGCTTA TTGGTCCGTC TATGACTTTC CTCAATTTAA GGAAGCCTTT 1320 

ACGATTCCTC TTCTTGAACT TCTAGGGCCT TCTTTTGACA GTCTTCTCCT AGGGGAGACC 13 80 

ACTTTGGAGA GAACCCAAGT CACAACAGAG AATGACGCCG TTCGAGGTTT CTGGTCCCTA 14 4 0 

AGCTGGGAAG AGTACCCCCC TTCTCTGGAT AAAGACAGAA GGATCACACC AACTAAGAAA 1500 

ACTGTTTT C C TCACTTGGAA T CCTG AG AT C ACTTCTACGC CATAA 154 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Thr lie Leu Arg Asn Phe Leu Thr Cys Ser Ala Leu Phe Leu Ala 

1 5 10 15 

Leu Pro Ala Ala Ala Gin Val Val Tyr Leu His Glu Ser Asp Gly Tyr 

20 25 30 

Asn Gly Ala lie Asn Asn Lys Ser Leu Glu Pro Lys lie Thr Cys Tyr 

35 40 45 

Pro Glu Gly Thr Ser Tyr He Phe Leu Asp Asp Val Arg He Ser Asn 

SO 55 60 

Val Lys His Asp Gin Glu Asp Ala Gly Val Phe He Asn Arg Ser Gly 
65 70 75 80 

Asn Leu Phe Phe Met Gly Asn Arg Cys Asn Phe Thr Phe His Asn Leu 

85 90 95 

Met Thr Glu Gly Phe Gly Ala Ala He Ser Asn Arg Val Gly Asp Thr 

100 105 110 

Thr Leu Thr Leu Ser Asn Phe Ser Tyr Leu Thr Phe Thr Ser Ala Pro 

115 120 125 

Leu Leu Pro Gin Gly Gin Gly Ala He Tyr Ser Leu Gly Ser Val Met 

130 135 140 

He Glu Asn Ser Glu Glu Val Thr Phe Cys Gly Asn Tyr Ser Ser Trp 
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145 "0 155 160 

Ser Gly Ala Ala He Tyr Thr Pro Tyr Leu Leu Gly Ser Lys Ala Ser 

165 170 175 

Arg Pro Ser Val Asn Leu Ser Gly Asn Arg Tyr Leu Val Phe Arg Asp 

180 185 190 

Tyr Val Ser Gin Gly Tyr Gly Gly Ala Val Ser Thr His Asn Leu Thr 

195 200 205 

Leu Thr Thr Arg Gly Pro Ser Cys Phe Glu Asn Asn His Ala Tyr His 

210 215 220 

Asp Val Asn Ser Asn Gly Gly Ala He Ala He Ala Pro Gly Gly Ser 
225 230 235 240 

He Ser He Ser Val Lys Ser Gly Asp Leu He Phe Lys Gly Asn Thr 

245 250 255 

Ala Ser Gin Asp Gly Asn Thr He His Asn Ser He His Leu Gin Ser 

260 265 270 

Gly Ala Gin Phe Lys Asn Leu Arg Ala Val Ser Glu Ser Gly Val Tyr 

275 280 285 

Phe Tyr Asp Pro He Ser His Ser Glu Ser His Lys lie Thr Asp Leu 

290 295 300 

Val He Asn Ala Pro Glu Gly Lys Glu Thr Tyr Glu Gly Thr He Ser 
305 310 315 320 

Phe Ser Gly Leu Cys Leu Asp Asp His Glu Val Cys Ala Glu Asn Leu 

325 330 335 

Thr Ser Thr He Leu Gin Asp Val Thr Leu Ala Gly Gly Thr Leu Ser 

340 345 350 

Leu Ser Asp Gly Val Thr Leu Gin Leu His Ser Phe Lys Gin Glu Ala 

355 360 365 

Ser Ser Thr Leu Thr Met Ser Pro Gly Thr Thr Leu Leu Cys Ser Gly 

370 375 380 

Asp Ala Arg Val Gin Asn Leu His He Leu He Glu Asp Thr Asp Asn 
385 390 395 400 

Phe Val Pro Val Arg He Arg Ala Glu Asp Lys Asp Ala Leu Val Ser 

405 410 415 

Leu Glu Lys Leu Lys Val Ala Phe Glu Ala Tyr Trp Ser Val Tyr Asp 

420 425 430 

Phe Pro Gin Phe Lys Glu Ala Phe Thr He Pro Leu Leu Glu Leu Leu 

435 440 445 

Gly Pro Ser Phe Asp Ser Leu Leu Leu Gly Glu Thr Thr Leu Glu Arg 

450 455 460 

Thr Gin Val Thr Thr Glu Asn Asp Ala Val Arg Gly Phe Trp Ser Leu 
465 470 475 480 

Ser Trp Glu Glu Tyr Pro Pro Ser Leu Asp Lys Asp Arg Arg He Thr 

485 490 495 

Pro Thr Lys Lys Thr Val Phe Leu Thr Trp Asn Pro Glu He Thr Ser 
500 505 510 

Thr Pro 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

ATGAAAACGT CTATTCGTAA GTTCTTAATT TCTACCACAC TGGCGCCATG TTTTGCTTCA 6 0 

ACAG CGTTT A CTGTAGAAGT TATCATGCCT TCCGAGAACT TTGATGGATC GAGTGGGAAG 12 0 

ATTTTTCCTT ACACAACACT TTCTGATCCT AGAGGGACAC TCTGTATTTT TT CAGGGG AT 180 

CTCTACATTG CGAATCTTGA TAATG CCAT A TCCAGAACCT CTTCCAGTTG CTTTAGCAAT 24 0 

AGGGCGGGAG CACTACAAAT CTTAGGAAAA GGTGGGGTTT TCTCCTTCTT AAATATCCGT 300 

TCTTCAGCTG ACGGAGCCGC GATTAGTAGT GTAATCACCC AAAATCCTGA ACTATGTCCC 3 60 

TTGAGTTTTT CAGGATTTAG TCAGATGATC TTCGATAACT GTGAATCTTT GACTTCAGAT 420 

ACCTCAGCGA GT AATGT CAT ACCTCACGCA TCGGCGATTT ACGCTACAAC GCCCATGCTC 4 80 

TTTACAAACA ATGACTCCAT ACTATTCCAA TACAAC CGTT CTGCAGGATT TGGAGCTGCC 54 0 

ATTCGAGGCA CAAGCATCAC AATAGAAAAT ACGAAAAAGA GCCTTCTCTT TAATGGTAAT 6 00 

GGATCCATCT CTAATGGAGG GGCCCTCACG GGATCTGCAG CGATCAACCT CATCAACAAT 66 0 

AGCGCTCCTG TGATTTTCTC AACGAATGCT ACAGGGATCT ATGGTGGGGC TATTTACCTT 72 0 

ACCGGAGGAT CTATGCTCAC CTCTGGGAAC CTCTCAGGAG TCTTGTTCGT TTATAATAGC 780 
TCGCGCT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Lys Thr Ser He Arg Lys Phe Leu He Ser Thr Thr Leu Ala Pro 

1 5 10 15 

Cys Phe Ala Ser Thr Ala Phe Thr Val Glu Val He Met Pro Ser Glu 

20 25 30 

Asn Phe Asp Gly Ser Ser Gly Lys He Phe Pro Tyr Thr Thr Leu Ser 

35 40 45 

Asp Pro Arg Gly Thr Leu Cys He Phe Ser Gly Asp Leu Tyr He Ala 

50 55 60 

Asn Leu Asp Asn Ala He Ser Arg Thr Ser Ser Ser Cys Phe Ser Asn 
65 70 75 80 

Arg Ala Gly Ala Leu Gin He Leu Gly Lys Gly Gly Val Phe Ser Phe 

85 90 95 

Leu Asn He Arg Ser Ser Ala Asp Gly Ala Ala lie Ser Ser Val He 

100 105 110 

Thr Gin Asn Pro Glu Leu Cys Pro Leu Ser Phe Ser Gly Phe Ser Gin 

115 120 125 

Met He Phe Asp Asn Cys Glu Ser Leu Thr Ser Asp Thr Ser Ala Ser 

"0 135 140 

Asn Val He Pro His Ala Ser Ala He Tyr Ala Thr Thr Pro Met Leu 
145 150 155 160 

Phe Thr Asn Asn Asp Ser He Leu Phe Gin Tyr Asn Arg Ser Ala Gly 

165 170 175 

Phe Gly Ala Ala He Arg Gly Thr Ser He Thr He Glu Asn Thr Lys 

180 185 190 

Lys Ser Leu Leu Phe Asn Gly Asn Gly Ser He Ser Asn Gly Gly Ala 

195 200 205 

Leu Thr Gly Ser Ala Ala He Asn Leu He Asn Asn Ser Ala Pro Val 
210 215 220 
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lie Phe Ser Thr 
225 

Thr Gly Gly Ser 

Val Tyr Asn Ser 
260 



Asn Ala Thr Gly 
230 

Met Leu Thr Ser 
245 

Ser Arg 



lie Tyr Gly Gly 
235 

Gly Asn Leu Ser 
250 



Ala lie Tyr Leu 
240 

Gly Val Leu Phe 
255 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



ATGAAGACTT CAGTTTCTAT GTTGTTGGCC CTGCTTTGCT CGGGGGCTAG CTCTATTGTA 
CTCCATGCCG CAACCACTCC ACTAAATCCT GAAGATGGGT TTATTGGGGA GGGCAATACA 
AATACTTTTT CTCCGAAATC TACAACGGAT GCTGCAGGAA CTACCTACTC TCTCACAGGA 
GAGGTTCTGT TTATAGATCC GGGGAAAGGT GGTTCAATTA CAGGAACTTG CTTTGTAGAA 
ACTGCTGGCG ATCTTACATT TTTAGGTAAT GGAAATACCC TAAAGTTCCT GT CGGTAG AT 
GCAGGTGCTA AT AT CG CGGT TGCTCATGTA CAAGGAAGTA AGAATTTAAG CTTCACAGAT 

TTCCTTTCTC TGGTGATCAC AGAATCTCCA AAATCCGCTG TTAGTACAGG AAAAGGTAGC 42 0 

CTAGTCAGTT CAGGTGCAGT CCAACTGCAA GATATAAACA CTCTAGTTCT TACAAGCAAT 4 80 

GCCTCTGTCG AAGATGGTGG CGTGATTAAA GGAAACTCCT GCTTGATTCA GGGAATCAAA 54 0 

AATAGTGCGA TTTTTGGACA AAATACATCT TCGAAAAAAG GAGGGGCGAT CTCCACGACT 6 00 

CAAGGACTCA C CAT AG AG AA TAACTTAGGG ACGCTAAAGT TCAATGAAAA CAAAG CAGTG 6 60 
ACCTCAGGAG GCGCCTTAGA TTTAGGAG CC GCGTCTACAT TCACTGCGAA CCATGAGTTG 
ATATTTTCAC AAAATAAGAC TTCTGGGAAT GCTGCAAATG GCGGAGCCAT AAATTGCTCA 
GGCGACCTAA CATTTACTGA TAACACTTCT TTGTTACTTC AAGAAAATAG CACAATGCAG 

GATGGTGGAG CTTTGTGTAG CACAGGAACC ATAAGCATTA CCGGTAGTGA TTCTATCAAT 900 

GTGATAGGAA ATACTTCAGG ACAAAAAGGA GGAGCGATTT CTGCAGCTTC TCTCAAGATT 960 

TTGGGAGGGC AGGG AGGCGC TCTCTTTTCT AATAACGTAG TG ACT CATG C CACCCCTCTA 1020 

GGAGGTGCCA TTTTTATCAA CACAGGAGGA TCCTTGCAGC TCTTCACTCA AGGAGGGGAT 10 80 

ATCGTATTCG AGGGGAATCA GGTCACTACA ACAGCTCCAA ATGCTACCAC TAAGAGAAAT 114 0 

GTAATTCACC T CG AG AG CAC CGCGAAGTGG ACGGGACTTG CTGCAAGTCA AGGTAACGCT 12 0 0 

ATCTATTTCT ATG ATC C CAT TACCACCAAC GATACGGGAG CAAGCGATAA CTTACGTATC 1260 

AATGAGGTCA GTGCAAATCA AAAGCTCTCG GGATCTATAG TATTTTCTGG AGAGAGATTG 1320 

TCGACAGCAG AAGCTATAGC TGAAAATCTT ACTTCGAGGA TCAACCAGCC TGTCACTTTA 13 80 

GTAGAGGGGA GCTTAGAACT TAAACAGGGA GTGACCTTGA TCACACAAGG ATTCTCGCAG 1440 

GAGCCAGAAT CCACGCTTCT TTTGGATTTG GGGACCTCAT TACAAGCTTC TACAGAAGAT 150 0 

ATCGTCATCA CAAATTCATC TATAAATGCC GATACCATTT ACGGAAAGAA TCCAATCAAT 156 0 

ATTGTAG CTT CAGCAGCGAA TAAGAACATT ACCCTAACAG GAACCTTAGC ACTTGTAAAT 162 0 

GCAGATGGAG CTTTGTATGA GAACCATACC TTG CAAGACT CTCAAGATTA TAGCTTTGTA 1680 

AAGTTATCT C CAGGAGCGGG AGGGACTATA ATT ACT CAAG ATGCTTCTCA GAAGCTTCTT 174 0 

GAAGTAGCTC CTTCTAGACC ACATTATGGC TAT CAAGGAC ATTGGAATGT GCAAGTCATC 1800 

CCAGGAACGG GAACTCAACC GAGCCAGGCA AATTTAGAAT GGGTGCGGAC AGGATACCTT 1860 

CCGAATCCCG AACGG CAAGG ATTTTTAGTT CCCAATAGCC TGTGGGGTTC TTTTGTTGAT 1920 

CAGCGTGCTA TCCAAGAAAT CATGGTAAAT AGTAGCCAAA TCTTATGTCA GGAACGGGGA 1980 

GTCTGGGGAG CTGGAATTGC TAATTTCCTA CATAGAGATA AAATTAATGA GCACGGCTAT 2040 

CGCCATAGCG GTGT CGGTTA TCTTGTGGGA GTTGGCACTC ATGCTTTTTC TGATGCTACG 2100 

ATAAATGCGG CTTTTTGCCA GCTCTTCAGT AGAGATAAAG ACTACGTAGT AT CC AAAAAT 2160 

CATGGAACTA GCTACTCAGG GGTCGTATTT CTTGAGGATA CCCTAGAGTT TAGAAGTCCA 2 22 0 
CAGGGATTCT ATACTGATAG CTCCTCAGAA GCTTGCTGTA ACCAAGTCGT CACTATAGAT 



60 
120 
180 
240 
300 
360 



720 
780 
840 
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ATGCAGTTGT CTTACAGCCA TAGAAATAAT GATATGAAAA CCAAATACAC GACATATCCA 2 34 0 

GAAGCT CAGG GATCTTGGGC AAATGATGTT TTTGGT CTTG AGTTTGGAGC GACTACATAC 24 0 0 

TACTACCCTA ACAGTACTTT TTTATTTGAT TACTACTCTC CGTTTCTCAG GCTGCAGTGC 24 60 

ACCTATGCTC ACCAGGAAGA CTTCAAAGAG ACAGGAGGTG AGGTTCGTCA CTTTACT AG C 2 52 0 

GGAGATCTTT T CAATTT AG C AGTTCCTATT GGCGTGAAGT TTGAGAGATT TTCAGACTGT 2 580 

AAAAGGGGAT CTTATGAACT TACCCTTGCT TATGTTCCTG ATGTGATTCG CAAAGATCCC 2 64 0 

AAGAGCACGG CAACATTGGC TAGTGG AG CT ACGTGGAGCA CCCACGGAAA CAATCTCTCC 2 700 

AGACAAGGAT TACAACTGCG TTTAGGGAAC CACTGTCTCA TAAATCCTGG AATTGAGGTG 2 76 0 

TTCAGT CACG GAGCTATTGA ATTG CGGGGA TCCTCTCGTA ATTATAACAT CAATCTCGGG 2 82 0 

GGTAAATACC GATTTTAA 2 83 8 

(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met 


Lys 


Thr 


Ser 


Val 


Ser 


Mo t- 


Leu 


Lieu 


Ala 


Leu 


Leu 


Cys 


Ser 


Gly 


Ala 










5 










10 










15 




Ser 


Ser 


He 


Val 


Leu 


His 


Al s 

r\JL d 


r\ JL a 


i nr 


i nr 


Pro 


Leu 


Asn 


Pro 


Glu 


Asp 








20 










25 










30 




Gly 


Phe 


He 


Gly 


Glu 


Gly 


Asn 


Thr 


Asn 


Thr 


Phe 


Ser 


Pro 


Lys 


Ser 


Thr 






35 










40 










45 






Thr 


Asp 


Ala 


Ala 


Gly Thr 


Thr 


Tyr 


Ser 


Leu 


Thr 


Gly 


Glu 


Val 


Leu 


Phe 




50 










55 










60 










He 


Asp 


Pro 


Gly 


Lys 


Gly 


Gly 


Ser 


He 


Thr 


Gly 


Thr 


Cys 


Phe 


Val 


Glu 


65 










70 










75 








80 


Thr 


Ala 


Gly 


Asp 


Leu 


Thr 


Phe 


Leu 


Gly 


Asn 


Gly Asn 


Thr 


Leu 


Lys 


Phe 










85 










90 










95 




Leu 


Ser 


Val 


Asp 


Ala 


Gly 


Ala 


Asn 


He 


Ala 


Val 


Ala 


His 


Val 


Gin 


Gly 








100 










105 










110 




Ser 


Lys 


Asn 


Leu 


Ser 


Phe 


Thr 


Asp 


Phe 


Leu 


Ser 


Leu 


Val 


He 


Thr 


Glu 






115 










120 










125 








Ser 


Pro 


Lys 


Ser 


Ala 


Val 


Ser 


Thr 


Gly 


Lys 


Gly 


Ser 


Leu 


Val 


Ser 


Ser 




130 










135 










140 










Gly 


Ala 


Val 


Gin 


Leu 


Gin 


Asp 


He 


Asn 


Thr 


Leu 


Val 


Leu 


Thr 


Ser 


Asn 


145 










150 










155 










160 


Ala 


Ser 


Val 


Glu 


Asp 


Gly 


Gly 


Val 


He 


Lys 


Gly Asn 


Ser 


Cys 


Leu 


He 










165 










170 










175 




Gin 


Gly 


He 


Lys 


Asn 


Ser 


Ala 


He 


Phe 


Gly 


Gin 


Asn 


Thr 


Ser 


Ser 


Lys 








180 










185 










190 




Lys 


Gly 


Gly Ala 


He 


Ser 


Thr 


Thr 


Gin 


Gly 


Leu 


Thr 


He 


Glu 


Asn 


Asn 






195 










200 










205 








Leu 


Gly 


Thr 


Leu 


Lys 


Phe 


Asn 


Glu 


Asn 


Lys 


Ala 


Val 


Thr 


Ser 


Gly 


Gly 




210 










215 










220 






Ala 


Leu 


Asp 


Leu 


Gly Ala 


Ala 


Ser 


Thr 


Phe 


Thr 


Ala 


Asn 


His 


Glu 


Leu 


225 










230 










235 










240 


He 


Phe 


Ser 


Gin 


Asn 


Lys 


Thr 


Ser 


Gly 


Asn 


Ala 


Ala 


Asn 


Gly Gly Ala 










245 










250 










255 




He 


Asn 


Cys 


Ser 


Gly Asp 


Leu 


Thr 


Phe 


Thr 


Asp 


Asn 


Thr 


Ser 


Leu 


Leu 
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Leu 


Gin 


Glu 


Asn 


Ser 


Thr 


Met 


Gin 


Asp 


Gly Gly Ala 


Leu 


Cys 


Ser 


Thr 






275 










280 










285 






Gly 


Thr 


Tie 


Ser 


He 


Thr 


Gly 


Ser 


Asp 


Ser 


He 


Asn 


Val 


He 


Gly 


Asn 




290 










295 










300 








Thr 


Ser 


Gly Gin 


Lys 


Gly 


Gly 


Ala 


He 


Ser 


Ala 


Ala 


Ser 


Leu 


Lys 


He 


305 










310 










315 








320 


Leu 


Gly 


Gly 


Gin 


Gly 


Gly Ala 


Leu 


Phe 


Ser 


Asn 


Asn 


Val 


Val 


Thr 


His 










325 










330 










335 




Ala 


Thr 


Pro 


Leu 


Gly Gly Ala 


He 


Phe 


He 


Asn 


Thr 


Gly 


Gly 


Ser 


Leu 








340 










345 








350 






Gin 


Leu 


Phe 


Thr 


Gin 


Gly Gly Asp 


He 


Val 


Phe 


Glu 


Gly 


Asn 


Gin 


Val 






355 










360 










365 








Thr 


Thr 
370 


Thr 


Ala 


Pro 


Asn 


Ala 
375 


Thr 


Thr 


Lys 


Arg 


Asn 
3 80 


Val 


He 


His 


Leu 


Glu 


Ser 


Thr 


Ala 


Lys 


Trp 


Thr 


Glv 


Leu 


Ala 


Ala 


Ser 


Gin 


Gly 


Asn 


Ala 


385 










3 90 










395 








400 


He 


Tyr 


Phe 


Tyr 


Asp 


Pro 


He 


Thr 


Thr 


Asn 


ASD 

tr 


Thr 


Gly 


Ala 


Ser 


ASD 










405 










410 










415 


Asn 


Leu 


Arg 


He 


Asn 


Glu 


Val 


Ser 


Ala 


Asn 


Gin 


Lys 


Leu 


Ser 


Gly 


Ser 








420 










425 










430 




He 


Val 


Phe 
435 


Ser 


Gly 


Glu 


Arg 


Leu 
44 0 


Ser 


Thr 


Ala 


Glu 


Ala 
445 


He 


Ala 


Glu 


Asn 


Leu 


Thr 


Ser 


Arg 


He 


Asn 


Gin 


Pro 


Val 


Thr 


Leu 


Val 


Glu 


Gly 


Ser 




450 










455 










460 








Leu 


Glu 


Leu 


Lys 


Gin 


Gly 


Val 


Thr 


Leu 


He 


Thr 


Gin 


Gly 


Phe 


Ser 


Gin 


465 










470 










475 








480 


Glu 


Pro 


Glu 


Ser 


Thr 


Leu 


Leu 


Leu 


Asp 


Leu 


Gly Thr 


Ser 


Leu 


Gin 


Ala 










485 










490 










495 




Ser 


Thr 


Glu 


Asp 


He 


Val 


He 


Thr 


Asn 


Ser 


Ser 


He 


Asn 


Ala 


Asp 

zr 


Thr 








500 










505 










510 




He 


Tyr 


Gly 


Lys 


Asn 


Pro 


He 


Asn 


He 


Val 


Ala 


Ser 


Ala 


Ala 


Asn 


Lvs 






515 










520 










525 






Asn 


He 


Thr 


Leu 


Thr 


Gly 


Thr 


Leu 


Ala 


Leu 


Val 


Asn 


Ala 


Asp 


Gly 


Ala 




530 










535 










540 






Leu 


Tyr 


Glu 


Asn 


His 


Thr 


Leu 


Gin 


Asp 


Ser 


Gin 


Asp 


Tyr 


Ser 


Phe 


Val 


545 










550 










555 










560 


Lys 


Leu 


Ser 


Pro 


Gly Ala 


Gly 


Gly 


Thr 


He 


He 


Thr 


Gin 


Asp 


Ala 


Ser 










565 










570 








575 




Gin 


Lys 


Leu 


Leu 


Glu 


Val 


Ala 


Pro 


Ser 


Arg 


Pro 


His 


Tyr 


Gly 


Tyr 


Gin 








580 










585 










590 




Gly 


His 


Trp 
595 


Asn 


Val 


Gin 


Val 


He 
600 


Pro 


Gly 


Thr 


Gly 


Thr 
605 


Gin 


Pro 


Ser 


Gin 


Ala 


Asn 


Leu 


Glu 


Trp 


Val 


Arg 


Thr 


Gly Tyr 


Leu 


Pro 


Asn 


Pro 


Glu 




610 










615 










620 










Arg 


Gin 


Gly 


Phe 


Leu 


Val 


Pro 


Asn 


Ser 


Leu 


Trp 


Gly 


Ser 


Phe 


Val 


Asp 

tr 


625 










630 










635 










640 


Gin 


Arg 


Ala 


He 


Gin 


Glu 


He 


Met 


Val 


Asn 


Ser 


Ser 


Gin 


He 


Leu 


Cys 










645 










650 










655 


Gin 


Glu 


Arg 


Gly Val 


Trp 


Gly Ala 


Gly 


He 


Ala 


Asn 


Phe 


Leu 


His 


Arg 








660 










665 










670 




Asp 


Lys 


He 


Asn 


Glu 


His 


Gly 


Tyr 


Arg 


His 


Ser 


Gly 


Val 


Gly 


Tyr 


Leu 






675 










680 










685 




Val 


Gly 


Val 


Gly Thr 


His 


Ala 


Phe 


Ser 


Asp 


Ala 


Thr 


He 


Asn 


Ala 


Ala 




690 










695 










700 










Phe 


Cys 


Gin 


Leu 


Phe 


Ser 


Arg 


Asp 


Lys 


Asp 


Tyr 


Val 


Val 


Ser 


Lys 


Asn 


705 










710 










715 








720 


His 


Gly Thr 


Ser 


Tyr 


Ser 


Gly 


Val 


Val 


Phe 


Leu 


Glu 


Asp 


Thr 


Leu 


Glu 
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Phe 


Arq 


Ser 


Pro 


Gin 


Glv 


Phe 


i y it 


Thr 
X Hi. 








740 












Cys 


Asn 


Gin 


Val 


Val 


Thr 


He 


Asp 


Met 






755 










760 




Asn 


Asn 


Asp 


Met 


Lvs 


Thr 


Lys 


Tvr 
x y x. 


Thr 

X 1XX. 




770 










775 






Ser 


Trp 


Ala 


Asn 


Asp 


Val 


Phe 


Gly 


Leu 


785 










7 90 








Tvr 


Tvr 

■* 


Pro 


Asn 


Ser 


Thr 


Php 

rue 


Leu 


rne 










80 5 










Arq 


Leu 


Gin 


Cvs 


Thr 


x yi 


Ala 


nib 


Gin 








820 










one 
O Z D 


Gly 


Glu 


Val 


Arcj 


His 


Phe 


Thr 


OCX. 


ui y 






835 










o*t u 




Pro 


lie 


Glv 


Val 


Lys 


Phe 




Arg 


erne 




350 










fl ^ 

O J j 






Tvr 


Glu 


Leu 


Thr 


Leu 


Ala 


iyr 


Vdi 


Pro 


86 5 


















Lys 


Ser 


Thr 


Ala 


Thr 


Leu 


Ala 


Ser 


Gly 










885 








Asn 


Asn 


Leu 


Ser 


Arg 


Gin 


Gly 


Leu 


Gin 








900 










905 


Leu 


lie 


Asn 


Pro 


Gly 


He 


Glu 


Val 


Phe 






915 










920 




Arg 


Gly 


Ser 


Ser 


Arg 


Asn 


Tyr 


Asn 


He 




930 










935 







Phe 



945 
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73 

730 735 



Asp 


t>er 


Ser 


Ser 


Glu 
750 


Ala 


Cys 


Gin 


Leu 


Ser 


Tyr 
765 


Ser 


His 


Arg 


Thr 


Tyr 


Pro 
780 


Glu 


Ala 


Gin 


Gly 


vj -L Li 


rfie 


Gly Ala 


Thr 


Thr 


Tyr 




7 9 5 










800 


Asp 


Tyr 


Tyr 


Ser 


Pro 


Phe 


Leu 


Q 1 A 










815 




CjlU 


Asp 


Phe 


Lys 


Glu 
830 


Thr 


Gly 


Asp 


Leu 


Phe 


Asn 
845 


Leu 


Ala 


Val 


Ser 


Asp 


Cys 


Lys 


Arg 


Gly 


Ser 






860 










Asp 


va x 


He 


Arg 


Lys 


Asp 


Pro 




875 










880 


Ala 


Thr 


Trp 


Ser 


Thr 


His 


Gly 


890 










895 




Leu 


Arg 


Leu 


Gly 


Asn 
910 


His 


Cys 


Ser 


His 


Gly 


Ala 
925 


He 


Glu 


Leu 


Asn 


Leu 


Gly 


Gly 


Lys 


Tyr 


Arg 



940 



(2) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3000 base pairs 

(B) TYPE: nucleic acid 

> (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 259... 3000 
(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AT CAGGTG AT AAAAGTTCCT CGTTAGCTAG TGACTGTAGG TGACATGAGA AAGCTAACAC 
GGAGGAAACT AAAACCCAAG GAATCGAAGT CTTCATGGTA ATGCTTTTGT TTTTTAGAGA 
ACTATTCGCA TCAATATAGA AACAAAATAA GTAAATCAAG TTAAAGATGA CAAAACAGCT 
GTCAAGAATT TTTAT CTTGA CTCTCTGAGT TTTCTATTTT ATATGACGCA AGTAAGAATT 
TAATAATAAA GTGGGTTT ATG AAA TCG CAA TTT TCC TGG TTA GTG CTC TCT 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser 
1 5 10 
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TCG ACA TTG GCA TGT TTT ACT AGT TGT TCC ACT GTT TTT GCT GCA ACT 33 9 

Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr 
15 20 25 

GCT GAA AAT ATA GGC CCC TCT GAT AGC TTT GAC GGA AGT ACT AAC ACA 3 37 

Ala Glu Asn lie Gly Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr 
30 35 40 

GGC ACC TAT ACT CCT AAA AAT ACG ACT ACT GGA ATA GAC TAT ACT CTG 43 5 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly He Asp Tyr Thr Leu 
45 50 55 

ACA GGA GAT ATA ACT CTG CAA AAC CTT GGG GAT TCG GCA GCT TTA ACG 4 83 

Thr Gly Asp He Thr Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr 
60 65 70 75 

AAG GGT TGT TTT TCT GAC ACT ACG GAA TCT TTA AGC TTT GCC GGT AAG 531 
Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys 
80 35 90 

GGG TAC TCA CTT TCT TTT TTA AAT ATT AAG TCT AGT GCT GAA GGC GCA 57 9 

Gly Tyr Ser Leu Ser Phe Leu Asn He Lys Ser Ser Ala Glu Gly Ala 
95 100 105 

GCA CTT TCT GTT ACA ACT GAT AAA AAT CTG TCG CTA ACA GGA TTT TCG 62 7 

Ala Leu Ser Val Thr Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser 
HO 115 120 

AGT CTT ACT TTC TTA GCG GCC CCA TCA TCG GTA ATC ACA ACC CCC TCA 67 5 

Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser Val He Thr Thr Pro Ser 
125 130 135 

GGA AAA GGT GCA GTT AAA TGT GGA GGG GAT CTT ACA TTT GAT AAC AAT 72 3 

Gly Lys Gly Ala Val Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn 
140 145 150 155 

GGA ACT ATT TTA TTT AAA CAA GAT TAC TGT GAG GAA AAT GGC GGA GCC 771 
Gly Thr He Leu Phe Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala 
160 165 170 

ATT TCT ACC AAG AAT CTT TCT TTG AAA AAC AGC ACG GGA TCG ATT TCT 819 
He Ser Thr Lys Asn Leu Ser Leu Lys Asn Ser Thr Gly Ser He Ser 
175 180 185 

TTT GAA GGG AAT AAA TCG AGC GCA ACA GGG AAA AAA GGT GGG GCT ATT 86 7 

Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala He 
190 195 200 

TGT GCT ACT GGT ACT GTA GAT ATT ACA AAT AAT ACG GCT CCT ACC CTC 915 
Cys Ala Thr Gly Thr Val Asp He Thr Asn Asn Thr Ala Pro Thr Leu 
205 210 215 

TTC TCG AAC AAT ATT GCT GAA GCT GCA GGT GGA GCT ATA AAT AGC ACA 96 3 

Phe Ser Asn Asn He Ala Glu Ala Ala Gly Gly Ala He Asn Ser Thr 
220 225 230 235 

GGA AAC TGT ACA ATT ACA GGG AAT ACG TCT CTT GTA TTT TCT GAA AAT 1011 
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Gly Asn Cys Thr He Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn 
240 245 250 

AGT GTG ACA GCG ACC GCA GGA AAT GGA GGA GCT CTT TCT GGA GAT GCC 1059 
Ser Val Thr Ala Thr Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala 
255 260 265 

GAT GTT ACC ATA TCT GGG AAT CAG AGT GTA ACT TTC TCA GGA AAC CAA 1107 
Asp Val Thr He Ser Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin 
270 275 280 

GCT GTA GCT AAT GGC GGA GCC ATT TAT GCT AAG AAG CTT ACA CTG GCT 115 5 

Ala Val Ala Asn Gly Gly Ala He Tyr Ala Lys Lys Leu Thr Leu Ala 
285 290 295 

TCC GGG GGG GGG GGG GGT ATC TCC TTT TCT AAC AAT ATA GTC CAA GGT 12 03 

Ser Gly Gly Gly Gly Gly He Ser Phe Ser Asn Asn lie Val Gin Gly 
300 305 310 315 

ACC ACT GCA GGT AAT GGT GGA GCC ATT TCT ATA CTG GCA GCT GGA GAG 12 51 

Thr Thr Ala Gly Asn Gly Gly Ala He Ser He Leu Ala Ala Gly Glu 
32C > 325 330 

TGT AGT CTT TCA GCA GAA GCA GGG GAC ATT ACC TTC AAT GGG AAT GCC 12 9 9 

' Cys Ser Leu Ser Ala Glu Ala Gly Asp He Thr Phe Asn Gly Asn Ala 
335 340 345 

ATT GTT GCA ACT ACA CCA CAA ACT ACA AAA AGA AAT TCT ATT GAC ATA 13 4 7 

He Val Ala Thr Thr Pro Gin Thr Thr Lys Arg Asn Ser He Asp He 
350 355 360 

GGA TCT ACT GCA AAG ATC ACG AAT TTA CGT GCA ATA TCT GGG CAT AGC 13 95 

Gly Ser Thr Ala Lys He Thr Asn Leu Arg Ala He Ser Gly His Ser 
365 370 375 

ATC TTT TTC TAC GAT CCG ATT ACT GCT AAT ACG GCT GCG GAT TCT ACA 1443 
He Phe Phe Tyr Asp Pro He Thr Ala Asn Thr Ala Ala Asp Ser Thr 
380 \ 385 390 395 

GAT ACT TTA AAT CTC AAT AAG GCT GAT GCA GGT AAT AGT ACA GAT TAT 14 91 

Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr 
400 405 410 

AGT GGG TCG ATT GTT TTT TCT GGT GAA AAG CTC TCT GAA GAT GAA GCA 153 9 

Ser Gly Ser He Val Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala 
415 420 425 

AAA GTT GCA GAC AAC CTC ACT TCT ACG CTG AAG CAG CCT GTA ACT CTA 1587 
Lys Val Ala Asp Asn Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu 
430 435 440 

ACT GCA GGA AAT TTA GTA CTT AAA CGT GGT GTC ACT CTC GAT ACG AAA 163 5 

Thr Ala Gly Asn Leu Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys 
445 450 455 

GGC TTT ACT CAG ACC GCG GGT TCC TCT GTT ATT ATG GAT GCG GGC ACA 16 83 

Gly Phe Thr Gin Thr Ala Gly Ser Ser Val He Met Asp Ala Gly Thr 
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460 465 470 



475 



ACG TTA AAA GCA AGT ACA GAG GAG GTC ACT TTA ACA GGT CTT TCC ATT 1731 
Thr Leu Lys Ala Ser Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He 
480 485 490 

CCT GTA GAC TCT TTA GGC GAG GGT AAG AAA GTT GTA ATT GCT GCT TCT 177 9 

Pro Val Asp Ser Leu Gly Glu Gly Lys Lys Val Val He Ala Ala Ser 
495 500 505 

GCA GCA AGT AAA AAT GTA GCC CTT AGT GGT CCG ATT CTT CTT TTG GAT 182 7 

Ala Ala Ser Lys Asn Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp 
510 515 520 

AAC CAA GGG AAT GCT TAT GAA AAT CAC GAC TTA GGA AAA ACT CAA GAC 187 5 

Asn Gin Gly Asn Ala Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp 
525 530 535 

TTT TCA TTT GTG CAG CTC TCT GCT CTG GGT ACT GCA ACA ACT ACA GAT 192 3 

Phe Ser Phe Val Gin Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp 
540 545 550 555 

GTT CCA GCG GTT CCT ACA GTA GCA ACT CCT ACG CAC TAT GGG TAT CAA 1971 
Val Pro Ala Val Pro Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin 
560 565 570 

GGT ACT TGG GGA ATG ACT TGG GTT GAT GAT ACC GCA AGC ACT CCA AAG 2 019 

Gly Thr Trp Gly Met Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys 
575 580 585 

ACT AAG ACA GCG ACA TTA GCT TGG ACC AAT ACA GGC TAC CTT CCG AAT 2 06 7 

Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn 
59 0 595 600 

CCT GAG CGT CAA GGA CCT TTA GTT CCT AAT AGC CTT TGG GGA TCT TTT 2115 
Pro Glu Arg Gin Gly Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe 
605 610 615 

TCA GAC ATC CAA GCG ATT CAA GGT GTC ATA GAG AGA AGT GCT TTG ACT 216 3 

Ser Asp He Gin Ala He Gin Gly Val He Glu Arg Ser Ala Leu Thr 
620 625 630 635 

CTT TGT TCA GAT CGA GGC TTC TGG GCT GCG GGA GTC GCC AAT TTC TTA 2211 
Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu 
64 0 645 650 

GAT AAA GAT AAG AAA GGG GAA AAA CGC AAA TAC CGT CAT AAA TCT GGT 22 5 9 

Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly 
655 660 665 

GGA TAT GCT ATC GGA GGT GCA GCG CAA ACT TGT TCT GAA AAC TTA ATT 23 0 7 

Gly Tyr Ala He Gly Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu He 
67 0 675 680 

AGC TTT GCC TTT TGC CAA CTC TTT GGT AGC GAT AAA GAT TTC TTA GTC 23 5 5 

Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val 
685 690 695 
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GCT AAA AAT CAT ACT GAT ACC TAT GCA GGA GCC TTC TAT ATC CAA CAC 2 40 3 

Ala Lys Asn His Thr Asp Thr Tyr Ala Gly Ala Phe Tyr He Gin His 

700 705 710 715 

ATT ACA GAA TGT AGT GGG TTC ATA GGT TGT CTC TTA GAT AAA CTT CCT 2 451 

He Thr Glu Cys Ser Gly Phe He Gly Cys Leu Leu Asp Lys Leu Pro 
7 20 725 730 

GGC TCT TGG AGT CAT AAA CCC CTC GTT TTA GAA GGG CAG CTC GCT TAT 24 9 9 

Gly Ser Trp Ser His Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr 
735 740 745 

AGC CAC GTC AGT AAT GAT CTG AAG ACA AAG TAT ACT GCG TAT CCT GAG 2 54 7 

Ser His Val Ser Asn Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu 
750 755 760 

GTG AAA GGT TCT TGG GGG AAT AAT GCT TTT AAC ATG ATG TTG GGA GCT 2 5 95 

Val Lys Gly Ser Trp Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala 
765 770 775 

TCT TCT CAT TCT TAT CCT GAA TAC CTG CAT TGT TTT GAT ACC TAT GCT 2 64 3 

Ser Ser His Ser Tyr Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala 

780 785 790 795 

CCA TAC ATC AAA CTG AAT CTG ACC TAT ATA CGT CAG GAC AGC TTC TCG 2 6 91 

Pro Tyr He Lys Leu Asn Leu Thr Tyr He Arg Gin Asp Ser Phe Ser 
800 805 810 

GAG AAA GGT ACA GAA GGA AGA TCT TTT GAT GAC AGC AAC CTC TTC AAT 2 73 9 

Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn 
815 820 825 

TTA TCT TTG CCT ATA GGG GTG AAG TTT GAG AAG TTC TCT GAT TGT AAT 2 787 

Leu Ser Leu Pro He Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn 
830 835 840 

GAC TTT TCT TAT GAT CTG ACT TTA TCC TAT GTT CCT GAT CTT ATC CGC 2 83 5 

Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr Val Pro Asd Leu He Arg 
845 850 855 

AAT GAT CCC AAA TGC ACT ACA GCA CTT GTA ATC AGC GGA GCC TCT TGG 2 883 

Asn Asp Pro Lys Cys Thr Thr Ala Leu Val He Ser Gly Ala Ser Trp 

860 865 870 875 

GAA ACT TAT GCC AAT AAC TTA GCA CGA CAG GCC TTG CAA GTG CGT GCA 2 931 

Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala 
880 885 890 

GGC AGT CAC TAC GCC TTC TCT CCT ATG TTT GAA GTG CTC GGC CAG TTT 2 97 9 

Gly Ser His Tyr Ala Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe 
895 900 905 



GTC TTT GAA GTT CGT GGA TCC 
Val Phe Glu Val Arg Gly Ser 
910 



3000 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 914 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 

15 10 15 

Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 

20 25 30 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 

35 40 45 

Lys Asn Thr Thr Thr Gly He Asp Tyr Thr Leu Thr Gly Asp He Thr 

50 55 60 

Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 

Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 

85 90 95 

Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 

100 105 no 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 

115 120 125 

Ala Ala Pro Ser Ser Val He Thr Thr Pro Ser Gly Lys Gly Ala Val 

130 1 3 5 140 

Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr He Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala He Ser Thr Lys Asn 

165 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser He Ser Phe Glu Gly Asn Lys 

180 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala He Cys Ala Thr Gly Thr 

195 200 205 

Val Asp He Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn He 

210 215 220 

Ala Glu Ala Ala Gly Gly Ala He Asn Ser Thr Gly Asn Cys Thr He 

It 5 , 230 235 240 

Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr 

245 250 255 

Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr He Ser 

260 265 270 

Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly 

275 280 285 

Gly Ala He Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly 

290 295 300 

Gly He Ser Phe Ser Asn Asn He Val Gin Gly Thr Thr Ala Gly Asn 

305 -310 315 320 

Gly Gly Ala He Ser He Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala 

32 5 330 335 

Glu Ala Gly Asp He Thr Phe Asn Gly Asn Ala He Val Ala Thr Thr 

340 345 350 
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Thr 


Thr 


Lys 


Arg 


Asn 


Ser 


He 


Asp 






355 










360 




He 


Thr 


Asn 


Leu 


Arg 


Ala 


He 


Ser 


Gly 


His 




370 










375 






Pro 


He 


Thr 


Ala 


Asn 


Thr 


Ala 


Ala 


Asp 


Ser 


385 










390 








Asn 


Lys 


Ala 


Asp 


Ala 


Gly 


Asn 


Ser 


Thr 


Asp 










405 
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Phe 


Ser 


Gly 


Glu 


Lys 


Leu 


Ser 


Glu 


Asp 


Glu 








420 










425 




Leu 


Thr 


Ser 


Thr 


Leu 


Lys 


Gin 


Pro 


Val 


Thr 
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440 






Val 


Leu 


Lys 


Arg 


Gly Val 


Thr 


Leu 


Asp 


Thr 




450 










455 






Ala 


Gly 


Ser 


Ser 


Val 


He 


Met 


Asp 


Ala 


Gly 

7 


465 










4 7 0 








Thr 


Glu 


Glu 


Val 


Thr 


Leu 


Thr 


Gly 


Leu 


Ser 










485 










4 90 


Gly 


Glu 


Gly 


Lys 


Lys 


Val 


Val 


He 


Ala 


Ala 








500 










505 




Val 


Ala 


Leu 


Ser 


Glv 


Pro 


He 


Leu 


Leu 


Leu 






515 










520 






Tyr 


Glu 


Asn 


His 


Asp 


Leu 


Gly 


Lys 


Thr 


Gin 




530 










535 








Leu 


Ser 


Ala 


Leu 


Gly 


Thr 


Ala 


Thr 


Thr 


Thr 


545 










,550 










Thr 


Val 
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Thr 
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Thr 


His 
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Gly Tyr 
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Val 
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Asp 


Thr 


Ala 


Ser 


Thr 


Pro 
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Trp 


Thr 
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Gly 


Tyr 


Leu 


Pro 
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Leu 


Val 


Pro 


Asn 


Ser 


Leu 


Trp 


Gly 


Ser 
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lie 


Gin 


Gly 


Val 


He 


Glu 
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Ser 


Ala 


Leu 
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Gly 


Phe 


Trp 
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Val 
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Arg 


Lys 


Tyr 


Arg 
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Lys 
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Gly 
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Gin 
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Ser 


Glu 
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Leu 
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46 0 
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Val 
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605 








Phe 


Ser 


Asp 


He 


bin 


Ala 




620 










Thr 


Leu 


Cys 


Ser 


As p 


Arg 


635 










o 4 U 


Leu 


Asp 


Lys 


Asp 


Lys 


Lys 










a c c 
bob 




Gly 


Gly Tyr Ala 


lie 


bly 








670 






He 


Ser 


Phe 


Ala 


t*ne 


Cys 






685 








Val 


Ala 


Lys 


Asn 


nlS 


l nr 




700 










His 


He 


Thr 


Glu 


cys 


Ser 


715 










Tift 


Pro 


Gly 


Ser 


Trp 


C? a -v 

ber 


His 
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Ser 
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750 
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Lys 


Gly 


Ser 


Trp 
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Ala 


Ser 


Ser 


His 


Ser 
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He 


Lys 


Leu 
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Glu 


Lys 


Gly 


Thr 


Glu 
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805 810 815 

Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro He 

820 825 830 

Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp 

835 840 845 

Leu Thr Leu Ser Tyr Val Pro Asp Leu He Arg Asn Asp Pro Lys Cys 

850 855 860 

Thr Thr Ala Leu Val He Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala 

885 890 895 

Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg 
900 905 910 

Gly Ser 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1200 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GAT CCT AAA AAT AAA GAG TAC ACA GGG ACC ATA CTC TTT TCT GGA GAA 

Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu 

1 5 10 15 

AAG AGT CTA GCA AAC GAT CCT AGG GAT TTT AAA TCT ACA ATC CCT CAG 

Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr He Pro Gin 

20 25 30 



48 



96 



AAC GTC AAC CTG TCT GCA GGA TAC TTA GTT ATT AAA GAG GGG GCC GAA 144 
Asn Val Asn Leu Ser Ala Gly Tyr Leu Val He Lys Glu Gly Ala Glu 
35 40 45 

GTC ACA GTT TCA AAA TTC ACG CAG TCT CCA GGA TCG CAT TTA GTT TTA 192 
Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu 
50 55 60 

GAT TTA GGA ACC AAA CTG ATA GCC TCT AAG GAA GAC ATT GCC ATC ACA 240 
Asp Leu Gly Thr Lys Leu He Ala Ser Lys Glu Asp He Ala He Thr 
65 7 ° 75 80 

GGC CTC GCG ATA GAT ATA GAT AGC TTA AGC TCA TCC TCA ACA GCA GCT 2 88 

Gly Leu Ala He Asp He Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 
85 90 95 
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GTT ATT AAA GCA AAC ACC GCA AAT AAA CAG ATA TCC GTG ACG GAC TCT 33 6 

Val He Lys Ala Asn Thr Ala Asn Lys Gin He Ser Val Thr Aso Ser 
100 105 110 

ATA GAA CTT ATC TCG CCT ACT GGC AAT GCC TAT GAA GAT CTC AGA ATG 3 84 

He Glu Leu He Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 
115 120 125 

AGA AAT TCA CAG ACG TTC CCT CTG CTC TCT TTA GAG CCT GGA GCC GGG 43 2 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 
130 135 140 

GGT AGT GTG ACT GTA ACT GCT GGA GAT TTC CTA CCG GTA AGT CCC CAT 4 80 

Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 
145 150 155 160 

TAT GGT TTT CAA GGC AAT TGG AAA TTA GCT TGG ACA GGA ACT GGA AAC 52 8 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 
165 170 175 

AAA GTT GGA GAA TTC TTC TGG GAT AAA ATA AAT TAT AAG CCT AGA CCT 57 6 

Lys Val Gly Glu Phe Phe Trp Asp Lys He Asn Tyr Lys Pro Arg Pro 
180 185 190 

GAA AAA GAA GGA AAT TTA GTT CCT AAT ATC TTG TGG GGG AAT GCT GTA 62 4 

Glu Lys Glu Gly Asn Leu Val Pro Asn He Leu Trp Gly Asn Ala Val 
195 200 205 

AAT GTC AGA TCC TTA ATG CAG GTT CAA GAG ACC CAT GCA TCG AGC TTA 672 
Asn Val Arg Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu 
210 215 220 

CAG ACA GAT CGA GGG CTG TGG ATC GAT GGA ATT GGG AAT TTC TTC CAT 72 0 

Gin Thr Asp Arg Gly Leu Trp He Asp Gly He Gly Asn Phe Phe His 
225 230 235 240 

GTA TCT GCC TCC GAA GAC AAT ATA AGG TAC CGT CAT AAC AGC GGT GGA 76 8 

Val Ser Ala Ser Glu Asp Asn He Arg Tyr Arg His Asn Ser Gly Gly 
245 250 255 

TAT GTT CTA TCT GTA AAT AAT GAG ATC ACA CCT AAG CAC TAT ACT TCG 816 
Tyr Val Leu Ser Val Asn Asn Glu He Thr Pro Lys His Tyr Thr Ser 
260 265 270 

ATG GCA TTT TCC CAA CTC TTT AGT AGA GAC AAA GAC TAT GCG GTT TCC 864 
Met Ala Phe Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser 
275 280 285 

AAC AAC GAA TAC AGA ATG TAT TTA GGA TCG TAT CTC TAT CAA TAT ACA 912 
Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr 
290 295 300 

ACC TCC CTA GGG AAT ATT TTC CGT TAT GCT TCG CGT AAC CCT AAT GTA 960 
Thr Ser Leu Gly Asn He Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val 
305 310 . 315 320 

AAC GTC GGG ATT CTC TCA AGA AGG TTT CTT CAA AAT CCT CTT ATG ATT 10 0 3 
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Asn val Gly lie Leu 5er Arg Arg Phe Leu Gin Asn Pro Leu Met lie 
325 330 335 

TTT CAT TTT TTG TGT GCT TAT GGT CAT GCC ACC AAT GAT ATG AAA ACA 10 5 6 

Phe His Phe Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr 
340 345 350 

GAC TAC GCA AAT TTC CCT ATG GTG AAA AAC AGC TGG AGA AAC AAT TGT 1104 

Asp Tyr Ala Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys 
355 360 365 

TGG GCT ATA AAA TGC GGA GGG AGC ATG CCT CTA TTG GTA TTT GAA AAC 1152 

Trp Ala lie Lys Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn 
370 375 380 

GGA AAA CTT TTC CAA GGT GCC ATC CCA TTT ATG AAA CTA CAA TTA GTT 12 0 0 

Gly Lys Leu Phe Gin Gly Ala He Pro Phe Met Lys Leu Gin Leu Val 

335 390 395 400 



(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr He Leu Phe Ser Gly Glu 

1 -5 10 - 15 

Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr He Pro Gin 

20 25 30 

Asn Val Asn Leu Ser Ala Gly Tyr Leu Val He Lys Glu Gly Ala Glu 

35 40 45 

Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val L-u 

50 55 . 60 

Asp Leu Gly Thr Lys Leu He Ala Ser Lys Glu Asp He Ala He Thr 
65 ? 0 75 80 

Gly Leu Ala He Asp He Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 

85 90 95 

Val He Lys Ala Asn Thr Ala Asn Lys Gin He Ser Val Thr Asp Ser 

100 105 110 

He Glu Leu He Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 

115 120 125 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 

130 135 140 

Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 
145 150 155 160 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 

165 170 175 

Lys Val Gly Glu Phe Phe Trp Asp Lys He Asn Tyr Lys Pro Arg Pro 
180 185 * 190 
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He 
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240 


Val 


Ser 
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Glu 


Asp 


Asn 


He 
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He 


Thr 


Pro 


Lys 


His 
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i y -L 


Thr 


Ser 








260 










2 65 










9 *7 n 






Met 


Ala 


Phe 


Ser 


Gin 


Leu 


Phe 


Ser 


Arcr 


Asp 


Lys 


Asp 


l yi 




Val 


Ser 






275 










280 








2 8 5 








Asn 


Asn 


Glu 


Tyr 


Arg 


Met 


Tvr 


Leu 


Gly 


Ser 


±yr 


Leu 


Tyr 


uin 


Tyr 


Thr 




290 










295 










3 00 








Thr 


Ser 


Leu 


Gly 


Asn 


He 


Phe 


Arg 




Ala 


O CJ. 


Arg 


Asn 


Pro 


Asn 


Val 


305 


Val 








310 










J 1 J 










320 


Asn 


Gly 


He 


Leu 


Ser 


Arg 


Arg 


rue 


Leu 


uin 


Asn 


fro 


Leu 


Met 


He 


Phe 








325 










_j o u 










335 




His 


Phe 


Leu 


Cys 


Ala 


Tyr 


Gly 


His 


Ala 


Thr 


As n 


As D 


Met 


Lys 


Thr 








340 










345 










350 




Asp 


Tyr 


Ala 


Asn 


Phe 


Pro 


Met 


Val 


Lys 


Asn 


Ser 


Trp 


Arg 


Asn 


Asn 


Cys 






355 










360 










365 






Trp 


Ala 


He 


Lys 


Cys 


Gly 


Gly 


Ser 


Met 


Pro 


Leu 


Leu 


Val 


Phe 


Glu 


Asn 




370 










375 










380 










Gly 


Lys 


Leu 


Phe 


Gin 


Gly 


Ala 


He 


Pro 


Phe 


Met 


Lys 


Leu 


Gin 


Leu 


Val 


385 










390 










395 








400 



(2) INFORMATION FOR SEQ ID NO : 2 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1830 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

GAT CTC ACA TTA GGG AGT CGT GAC AGT TAT AAT GGT GAT ACA AGC ACC 4 8 

Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr 
15 10 15 

ACA GAA TTT ACT CCT AAA GCG GCA ACT TCT GAT GCT AGT GGC ACG ACC 9 6 

Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr 
20 25 30 

TAT ATT CTC GAT GGG GAT GTC TCG ATA AGC CAA GCA GGG AAA CAA ACG 14 4 

Tyr He Leu Asp Gly Asp Val Ser He Ser Gin Ala Gly Lys Gin Thr 
35 40 45 
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AGC TTA ACC ACA AGT TGT TTT TCT AAC ACT GCA GGA AAT CTT ACC TTC 192 
Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe 
50 55 60 



TTA GGG AAC GGA TTT TCT CTT CAT TTT GAC AAT ATT ATT TCG TCT ACT 
Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn lie He Ser Ser Thr 
65 70 75 80 



GCT CTC AAT ATT AAT AGC CCT GAT ACT GGA GAT AAC AAA GAG TAT ACG 
Ala Leu Asn He Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
145 150 155 160 



240 



GTT GCA GGT GTT GTT GTT AGC AAT ACA GCA GCT TCT GGG ATT ACG AAA 288 

Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly He Thr Lys 
85 90 95 

TTC TCA GGA TTT TCA ACT CTT CGG ATG CTT GCA GCT CCT AGG ACC ACA 33 6 

Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr 
100 105 110 

GGT AAA GGA GCC ATT AAA ATT ACC GAT GGT CTG GTG TTT GAG AGT ATA 3 84 

Gly Lys Gly Ala He Lys He Thr Asp Gly Leu Val Phe Glu Ser He 

115 120 125 

GGG AAT CTT GAT CCG ATT ACT GTA ACA GGA TCG ACA TCT GTT GCT GAT 4 32 

Gly Asn Leu Asp Pro He Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
130 135 140 



480 



GGA ACC ATA GTC TTT TCT GGA GAG AAG CTC ACG GAG GCA GAA GOT AAA 52 8 

Gly Thr He Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
!65 170 175 

GAT GAG AAG AAC CGC ACT TCT AAA TTA CTT CAA AAT GTT GCT TTT AAA 5 76 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
180 ' 185 190 

AAT GGG ACT GTA GTT TTA AAA GGT GAT GTC GTT TTA AGT GCG AAC GGT 62 4 

Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly 
195 200 205 

TTC TCT CAG GAT GCA AAC TCT AAG TTG ATT ATG GAT TTA GGG ACG TCG 6 72 

Phe Ser Gin Asp Ala Asn Ser Lys Leu He Met Asp Leu Gly Thr Ser 
210 215 220 

TTG GTT GCA AAC ACC GAA AGT ATC GAG TTA ACG AAT TTG GAA ATT AAT 72 0 

Leu Val Ala Asn Thr Glu Ser He Glu Leu Thr Asn Leu Glu He Asn 
225 230 235 240 

ATA GAC TCT CTC AGG AAC GGG AAA AAG ATA AAA CTC AGT GCT GCC ACA 76 8 

He Asp Ser Leu Arg Asn Gly Lys Lys He Lys Leu Ser Ala Ala Thr 
245 250 255 

GCT CAG AAA GAT ATT CGT ATA GAT CGT CCT GTT GTA CTG GCA ATT AGC 316 
Ala Gin Lys Asp He Arg He Asp Arg Pro Val Val Leu Ala He Ser 
260 265 270 



GAT GAG AGT TTT TAT CAA AAT GGC TTT TTG AAT GAG GAC CAT TCC TAT 



864 
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Asp Glu Ser 
275 

GAT GGG ATT 
Asp Gly lie 
290 

GAT TCT CGC 
Asp Ser Arg 
305 

AAG TGG ACG 
Lys Trp Thr 



TGG GCG AAG 

Trp Ala Lys 



GTT CCT AAT 
Val Pro Asn 
355 

AAT TTT ATA 
Asn Phe lie 
370 

TGG GTT GCA 
Trp Val Ala 
385 

CAA AGG AAA 
Gin Arg Lys 



ACG AGG ATG 
Thr Arg Met 



TTT GCG CGT 
Phe Ala Arg 
435 

TAC GCA GGA 
Tyr Ala Gly 
450 

GTG AGT ATC 
Val Ser lie 
465 

TAT GTT TCC 
Tyr Val Ser 



GGC CAT ACG 
Gly His Thr 



Phe Tyr Gin 



CTT GAG TTA 
Leu Glu Leu 



AGT ATA GAT 
Ser lie Asp 
310 

ATC AAT TGG 
lie Asn Trp 
325 

CAG AGT TTT 
Gin Ser Phe 
340 

CTT CTT TGG 
Leu Leu Trp 

GAG CTA GGT 
Glu Leu Gly 



GGC ATT TCC 
Gly lie Ser 
390 

TTC CGT CAT 
Phe Arg His 
405 

CCG GGT GGT 
Pro Gly Gly 
420 

GAC AAA GAC 
Asp Lys Asp 



TCT TTA CGT 
Ser Leu Arg 



CTT TTA GGA 
Leu Leu Gly 
470 

AAT ACT CTG 
Asn Thr Leu 
485 

GAT CAT CGC 
Asp His Arg 



Asn Gly Phe 
280 

GAT GCT GGG 
Asp Ala Gly 
295 

GCT GTA CAA 
Ala Val Gin 



TCT ACT GAT 
Ser Thr Asp 



AAT CCC ACT 

Asn Pro Thr 
345 

GGT TCT TTT 
Gly Ser Phe 
360 

ACT GAA GGT 
Thr Glu Gly 
375 

AAT GTT TTG 
Asn Val Leu 



GTG AGT GGA 
Val Ser Gly 



GAT ACC TTG 
Asp Thr Leu 
425 

TAC TTT ATG 
Tyr Phe. Met 
440 

TTG CAG CAC 
Leu Gin His 
455 

GAG GGA GGA 
Glu Gly Gly 



CCG TGC TCT 
Pro Cys Ser 



ATG AAG ACC 
Met Lys Thr 



Leu Asn Glu 



AAA GAC ATC 
Lys Asp lie 
300 

TCT CCG TAT 
Ser Pro Tyr 
315 

GAT AAG AAA 
Asp Lys Lys 
330 

GCT GAG CAG 
Ala Glu Gin 



ATA GAT GTT 
lie Asp Val 



GCT CCT TAC 
Ala Pro Tyr 
380 

CAT AGG AGC 
His Arg Ser 
395 

GGT GCT GTA 
Gly Ala Val 
410 

TCT CTG GGT 
Ser Leu Gly 



AAT ACC AAT 
Asn Thr Asn 



GAT GCT TCC 
Asp Ala Ser 
460 

CTC CGC GAG 
Leu Arg Glu 
475 

TTC TAT GGG 
Phe Tyr Gly 
490 

GAG TCT CTA 
Glu Ser Leu 



Asp His Ser 
285 

GTG ATT TCT 
Val lie Ser 



GGC TAT CAG 
Gly Tyr Gin 



GCT ACG GTT 
Ala Thr Val 
335 

GAG GCT CCG 
Glu Ala Pro 
350 

CGT TCC TTC 
Arg Ser Phe 
365 

GAA AAG AGA 
Glu Lys Arg 



GGT CGT GAA 
Gly Arg Glu 



GTA GGT GCT 
Val Gly Ala 
415 

TTT GCT CAG 
Phe Ala Gin 
430 

TTC GCA AAG 
Phe Ala Lys 
445 

CTA TAC TCT 
Leu Tyr Ser 

ATC CTG TTG 
lie Leu Leu 



CAG CTT AGC 
Gin Leu Ser 
495 

CCC CCC CCC 
Pro Pro Pro 



Tyr 



GCA 912 
Ala 



GGA 96 0 

Gly 

320 

TCT 1008 
Ser 



TTA 1056 
Leu 



CAG 110 4 

Gin 



TTT ~ 1152 
Phe 



AAT 1200 

Asn 

400 

AGC 124 8 

Ser 



CTC 12 96 

Leu 



ACC 1344 
Thr 



GTG 13 92 

Val 



CCT 1440 

Pro 

480 

TAC 14 88 

Tyr 



CCC 153 6 

Pro 
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CCG ACG CTC 
Pro Thr Leu 
515 

GGA GAG CTG 
Gly Glu Leu 
530 

TTT TTC CGA 
Phe Phe Arg 
545 

CGC CAA GAT 
Arg Gin Asp 

GAT TCG CAT 
Asp Ser His 

AAA CGG TTT 
Lys Arg Phe 
595 

GAT GTT 
Asp Val 
610 



500 

TCG ACG GAT 
Ser Thr Asp 



GGA ACT CGA 
Gly Thr Arg 



GAG TAC ACT 
Glu Tyr Thr 
550 

AGC TTT GTT 
Ser Phe Val 
565 

CTT TAT AAC 
Leu Tyr Asn 
580 

GCA GAG CAA 
Ala Glu Gin 



505 

CAT ACT TCT 
His Thr Ser 
520 

GTT GCT GTT 
Val Ala Val 
535 

CCA TTT GTA 
Pro Phe Val 



GAA CTA GGA 
Glu Leu Gly 

CTT GCG ATT 
Leu Ala lie 
585 

TAT TAT CAT 
Tyr Tyr His 
600 



86 



TGG GGA GGA 
Trp Gly Gly 



GAA AAT ACC 
Glu Asn Thr 
540 

AAA GTC CAA 
Lys Val Gin 
555 

GCT ATC AGT 
Ala He Ser 
570 

CCT CTT GGA 
Pro Leu Gly 



GTT GTT GCG 
Val Val Ala 



510 

TAT GTC TGG 
Tyr Val Trp 
525 

AGC GGC AGA 
Ser Gly Arg 



GCT GTT TAC 
Ala Val Tyr 

CGT GAT TTT 
Arg Asp Phe 
575 

ATC AAG TTA 
lie Lys Leu 
590 

ATG TAT TCT 
Met Tyr Ser 
605 



GCT 15 84 

Ala 



GGA 163 2 
Gly 



TCG 1680 

Ser 

560 

AGT 172 8 

Ser 



GAG 1776 
Glu 



CCA 1824 
Pro 



1830 



(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Asp 


Leu 


Thr 


Leu 


Gly 


Ser 


Arg 


Asp 


Ser 


Tyr 


Asn 


Gly 


Asp 


Thr 


Ser 


Thr 


1 








5 










10 










15 




Thr 


Glu 


Phe 


Thr 


Pro 


Lys 


Ala 


Ala 


Thr 


Ser 


Asp 


Ala 


Ser 


Gly 


Thr 


Thr 








20 










25 










30 






Tyr 


He 


Leu 


Asp 


Gly 


Asp 


Val 


Ser 


He 


Ser 


Gin 


Ala 


Gly 


Lys 


Gin 


Thr 






35 










40 










45 






Ser 


Leu 


Thr 


Thr 


Ser 


Cys 


Phe 


Ser 


Asn 


Thr 


Ala 


Gly 


Asn 


Leu 


Thr 


Phe 




50 










55 










60 










Leu 


Gly 


Asn 


Gly 


Phe 


Ser 


Leu 


His 


Phe 


Asp 


Asn 


He 


He 


Ser 


Ser 


Thr 


65 










70 










75 










80 


val 


Ala 


Gly 


Val 


Val 


Val 


Ser 


Asn 


Thr 


Ala 


Ala 


Ser 


Gly 


He 


Thr 


Lys 










35 










90 










95 


Phe 


Ser 


Gly 


Phe 


Ser 


Thr 


Leu 


Arg 


Met 


Leu 


Ala 


Ala 


Pro 


Arg 


Thr 


Thr 
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100 










105 










110 






Gly 


Lys 


Gly Ala 


He 


Lys 


He 


Thr 


Asp 


Glv 


Leu 


v ex X 


rflc 


ulu 


ber 


He 






115 










120 










X ^ J 






Gly 


Asn 


Leu 


Asp 


Pro 


He 


Thr 


Val 


Thr 


Glv 


*3 CI 


1 XIX 


Cor 
OCI 


vai 


Ala 


Asp 




130 










135 










14 0 








Ala 


Leu 


Asn 


He 


Asn 


Ser 


Pro 


Asp 


Thr 


Gly 




Asn 


.uys 


p 1 1 1 

bill 


Tyr 


Thr 


145 










150 










155 








lbU 


Gly 


Thr 


He 


Val 


Phe 


Ser 


Glv 


Glu 


Lys 


Leu 


i nr 


CjlU 


Ala 


Glu 


Ala 


Lys 










165 










170 










175 


Asp- 


Glu 


Lys 


Asn 


Arg 


Thr 


Ser 


Lys 


Leu 


Leu 


vj 1U 


Asn 


vai 


Ala 


Phe 


Lys 








180 










185 










190 




Asn 


Gly 


Thr 


Val 


Val 


Leu 


Lvs 


Glv 


Asp 


Val 


vax 


Leu 


O /-v -r- 


Ala 


Asn 


Gly 






195 










200 










one 
0 b 






Phe 


Ser 


Gin 


Asp 


Ala 


Asn 


Ser 


Lvs 


Leu 


Tic* 




Asp 


Leu 


Gly Thr 


Ser 




210 










215 










22 0 










Leu 


Val 


Ala 


Asn 


Thr 


Glu 


Ser 


He 


Glu 


L6U 


i nr 


Asn 


Leu 


Glu 


He 


Asn 


225 










230 










235 










240 


He 


Asp 


Ser 


Leu 


Ara 
245 


Asn 


G*i v 
vj - y 


Lvs 


Lys 


j. — e 
2 50 


L/S 


Leu 


Ser 


Ala 


Ala 
255 


Thr 


Ala 


Gin 


Lvs 


Asp 


He 


Ara 


He 


Asp 


Arg 


Pro 


vai 


val 


Leu 


Ala 


He 


Ser 








260 










265 










270 




Asp 


Glu 


Ser 


Phe 


Tyr 


Gin 


Asn 


Glv 


Phe 


Leu 


Asn 


pi 


Asp 


His 


Ser 


Tyr 






275 










280 










285 






Asp 


Gly 
290 


He 


Leu 


Glu 


Leu 


Asp 
295 


Ala 


\j i y 


Lys 


Asp 


lie 
300 


Val 


He 


Ser 


Ala 


Asp 


Ser 




Ser 


He 


Asp 


Ala 


V^ 1 
val 


uin 


Ser 


Pro 


Tyr 


Gly Tyr 


Gin 


Gly 


305 










310 










TIC 

-J X 3 










3 20 


Lys 


Tro 


Thr 


He 


Asn 
325 




Ser 


J. LXL. 


Asp 


Asp 
3 3 0 


Lys 


Lys 


Ala 


Thr 


Val 
335 


Ser 


Trp 


Ala 


Lys 


Gin 


Ser 


Phe 


Asn 


Pro 


■L il-L 


aid 


ulu 


bin 


Glu 


Ala 


Pro 


Leu 


Val 






340 










345 










350 






Pro 


Asn 
355 


Leu 


Leu 


Tro 


Glv 


Ser 
360 




lie 


Asp 


vai 


Arg 
365 


Ser 


Phe 


Gin 


Asn 


Phe 


He 


Glu 


Leu 


Gly 


Thr 


Glu 


vj X y 


AT-, 

M.-L a 


Pro 


Tyr 


Glu 


Lys 


Arg 


Phe 




370 










375 










q n 






Trp 


Val 


Ala 


Gly 


lie 


Ser 


Asn 


Val 


■Lieu 


nib 


Arg 


O a 


Gly 


Arg 


Glu 


Asn 


385 










390 










J 7 J 






4 00 


Gin 


Ara 


Lys 


Phe 


Arg 


His 


Val 




ui y 


PI w 

Vjiy 


Ala 


val 


Val 


Gly Ala 


Ser 










405 










*x X U 










415 




Thr 


Arq 


Met 


Pro 


Gly Gly 


Asp 


Thr 


Leu 


o er 


Leu 


Cjly 


Phe 


Ala 


Gin 


Leu 


Phe 






420 










425 








430 






Ala 


Ara 


ASD 


Lys 


Asp 


Tvr 


Phe 




Asn 


i nr 


Asn 


Phe 


Ala 


Lys 


Thr 






435 










440 










445 






Tyr 


Ala 


Glv 


Ser 


Leu 


Arg 


Leu 


Gin 


His 


As p 


rila 


oer 


Leu 


Tyr 


Ser 


Val 




450 










455 










460 








Val 


Ser 


He 


Leu 


Leu 


Gly 


Glu 


Gly 


Gly 


Leu 


Arg 


Glu 


He 


Leu 


Leu 


Pro 


465 










470 










475 










480 


Tyr 


Val 


Ser 


Asn 


Thr 


Leu 


Pro 






rue 


Tyr 


Gly 


Gin 


Leu 


Ser 


Tyr 










485 










4 90 










495 


Gly 


His 


Thr 


Asp 


His 


Arg 


Met 


Lys 


Thr 


Glu 


Ser 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 






500 










505 










510 




Thr 


Leu 


Ser 


Thr 


Asp 


His 


Thr 


Ser 


Trp 


Gly 


Gly Tyr 


Val 


Trp 


Ala 






515 










520 










525 






Gly 


Glu 


Leu 


Gly Thr 


Arg 


Val 


Ala 


Val 


Glu 


Asn 


Thr 


Ser 


Gly Arg 


Gly 




530 










535 










540 








Phe 


Phe 


Arg 


Glu 


Tyr 


Thr 


Pro 


Phe 


Val 


Lys 


Val 


Gin 


Ala 


Val 


Tyr 


Ser 


545 










550 










555 








560 
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Arg Gin Asp Ser Phe Val Glu Leu 
565 

Asp Ser His Leu Tyr Asn Leu Ala 
580 

Lys Arg Phe Ala Glu Gin Tyr Tyr 
595 600 

Asp Val 
610 



Gly Ala lie Ser Arg Asp Phe Ser 

570 575 
lie Pro Leu Gly lie Lys Leu Glu 
585 590 
His Val Val Ala Met Tyr Ser Pro 
605 
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89 
Claims 



1. Species specific diagnostic test for identifying 
infection of a mammal, such as a human, with Chlamydia, 
pneumoniae, said test comprising detecting in a patient or in 
a patient sample the presence of antibodies against one or 
more proteins from the outer membrane of Clamydia pneumoniae. 
said proteins being of a molecular weight of 100.3-89.6 kDa 
or of 56.1 kDa, or detecting the presence of nucleic acid 
fragments encoding said outer membrane proteins. 



2. Diagnostic test according to claim 1, wherein the outer 
membrane protein has the sequence as shown in SEQ ID NO : 2, 
SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO : 8, SEQ ID NO: 10, SEQ 
ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ 
ID NO: 20, SEQ ID NO: 22, or in SEQ ID NO: 24, or a variant 

15 or subsequence thereof . 

3. Diagnostic test according to claim 1, wherein the nucleic 
acid fragment has the sequence shown in SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 
11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 

20 19, SEQ ID NO: 21, or in SEQ ID NO: 23, or a variant or 
subsequence thereof . 

4. Diagnostic test according to claim 3 wherein detection of 
nucleic acid fragments is obtained by using nucleic acid 
amplification . 

25 5. Diagnostic test according to claim 4, wherein detection 
of nucleic acid fragments is obtained by using polymerase 
chain reaction. 

6. A nucleic acid fragment derived from Chlamydia pneumoniae 
comprising the nucleotide sequence SEQ ID NO: 1, SEQ ID NO: 
30 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO : 11, 
SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, 
SEQ ID NO: 21, or SEQ ID NO : 23, or a variant or subset. - 
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of said nucleotide sequence which has a sequence homology of 
at least 50% with any of the sequences mentioned. 

7 . A protein derived from Chlamydia pneumoniae having the 
amino acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO: 6, SEQ ID NO : 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID 
NO: 22, or SEQ ID NO: 24, or a variant or subsequence thereof 
having a sequence similarity of at least 5 0% and a similar 
biological function. 



8. Polyclonal monospecific antibody against the protein 
with the sequence shown in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 
14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 
22, or SEQ ID NO: 24, or a variant or subsequence thereof. 

9 . A diagnostic kit for the diagnosis of infection of a 
mammal, such as a human, with Chlamydia pneumoniae, said kit 
comprising a protein with the amino acid sequence SEQ ID NO: 
2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO : 8, SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16. SEQ ID NO: 18, 
SEQ ID NO:. 20, SEQ ID NO: 22, or SEQ ID NO: 24, or a variant 
or subsequence thereof . 

10. A diagnostic kit for the diagnosis of infection of a 
mammal, such as a human, with Chlamydia pneumoniae, said kit 
comprising antibodies against a protein with the amino acid 
sequence SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO: 

8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 
16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID 
NO: 24, or a variant or subsequence thereof. 

11. A diagnostic kit for the diagnosis of infection of a 
mammal, such as a human, with Chlamydia pneumoniae, said kit 
comprising a nucleic acid fragment with the sequence SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO : 7, SEQ ID NO: 

9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 
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17, SEQ ID NO: 19, SEQ ID NO: 21, or SEQ ID NO: 23, or a 
variant or subsequence thereof . 

12. A composition for immunizing a mammal, such as a 

human, against Chlamydia pneumoniae, said composition 
comprising a protein with the amino acid sequence shown in 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID NO: 24, or 
a variant or subsequence thereof . 

13 . Use of a protein with the sequence shown in SEQ ID 

NO: 2, SEQ ID NO : 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 

18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID NO: 24, or a 
variant or subsequence thereof in diagnosis of infection of a 
mammal, such as a human, with Chlamydia pneumoniae. 

14 . Use of the protein with the sequence shown in SEQ ID 

NO: 2, SEQ ID NO: 4, SEQ ID NO : 6, SEQ ID NO: 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID NO: 24 or a 
variant or . subsequence thereof- in an undenatured form, in 
diagnosis of infection of a mammal, such as a human, with 
Chlamydia pneumoniae . 

15 . Use of a protein with the sequence shown in SEQ ID 

NO: 2, SEQ ID NO : 4, SEQ ID NO : 6, SEQ ID NO: 8, SEQ ID NC : 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID NO: 24, or a 
variant or subsequence thereof, for immunizing a mammal, such 
as a human, against Chlamydia pneumoniae. 

16 . Use of the protein with the sequence shown in SEQ ID 

NO: 2, SEQ ID NO : 4, SEQ ID NO : 6, SEQ ID NO: 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 20, SEQ ID NO: 22, or SEQ ID NO: 24, or a 
variant or subsequence thereof in an undenatured form, for 
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immunizing a mammal, such as a human, against Chlamydia 
pneumoniae . 

17. Use of a nucleic acid fragment with the nucleotide 

sequence shown in SEQ ID NO: 1 SEQ ID NO: 3, SEQ ID NO: 5, 
5 SEQ ID NO: 7 , SEQ ID NO : 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ 
ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, or 
SEQ ID NO: 23, or a variant or subsequence of said nucleotide 
sequence which has a sequence homology of at least 50% with 
any of the sequences mentioned for immunizing a mammal, such 
10 as a human, against Chlamydia pneumoniae. 



SUBSTITUTE SHEET (RULE 25) 



WO 98/58953 

1/21 




PCT/DK98/00266 



WO 98/58953 PCTYDK98/00266 

2/21 




IF™ fi ° 



WO 98/58953 



PCT/DK98/00266 



3/21 



kDA 
97.4 




36.5 
31.0 



Fig. 3 



4/21 



PCT/DK98/00266 




Fig B 4 



WO 98/58953 




PCT/DK98/00266 



5/21 



kDa 1 2 3 4 5 6 

250 •V::: : ;;:;/;::x: ; :- : .:-..,:.. 



98 



64 



50 



Fig- 5 



WO 98/58953 



6/21 



PCT/DK98/00266 



PEX2 -18 

pEXM f 



h- \ 4410 

1093 



pE£3j36 ^ pEX2-10 

Pstldones 2188 3704 

I 1 

1227 

B8F3 

— 1 

4570 

omp5 omp4 

2740 2902 5688 



1 000 2000 3000 4000 5000 6000 



Fig. 6 



WO 98/58953 • PCT/DK98/00266 

7/21 



C. pneumoniae omp4-15 gene clusters 



omp12 omp11 ompio 
' * * ► ► 



omp5 omp4 omp13 omp14 
► ► ► 



2: 



omp6 



omp7 omp8 omp9 omp15 
► ► ► ► 



kbp 0 



10 



15 



20 



25 



30 



Fig. 7 



WO 98/58953 



PCT/DK98/00266 



o n co co o vd in i — u~> ro 
^31 ^r* ■^r i -^r ■^t* •'nT ^3* -^ji 



CD CD CM lO rO ^J* CO t — t VX) CD CO < — I 
CT\ CT\ CTi CT\ CX\ CO CT\ CT\ CT\ CO CT\ 



>|CL. 


ex. 


CL, 


cl, 


a, 


-1 


CO 


'1 


CL, 


-1 


co e-* 




6-t 


8- 


E- 


co 


CO 


1 


Da 


EX, 


tx, >- 


CX, 


IX 


>i 


IX, 


CX, 


H 


'1 


HI 


1 — 1 


21 Eh 


CO 


6- 


E-« 




E-« 


CO 




co 




O O 




CO 




0 


21 


E-» 


< 






1 E~» 


E- 


CD 


CL, 


CO 


E-« 


E~« 


2; 


2; 


CO 


S 2; 


CO 


O 


l 


Q 




CO 


0 2; 


CO 


E-« 


E-< 


< 


1 


i—* 


O 




CO 


h-H 


1 


2; co 


a 


1 


l 




DJ 




Q 


< 




O 0 


0 




O 


a 


O 


0 


CD 


O 


"1 


a q 


2; 


O 


CO 


2; 


l-H 


Q 


a: 


2J 


Q 


CX, IX, 




Cl, 


IX, 




(X, 


O- 


IX, 


>* 


H 


CO CO 

<|q 


CO 


CO 


CO 


CO 


fol 


CO 


CO 


0 


2; 


Q 


O 






Q 






Q 


M l 



5 El 



col CLi cl, cl, cl, 



pjjji-H pi -3 ,-3 ^3 „3 „3 .-3 3]»-H 

1 1 1 > 2; <c e-i 1 o 
1 1 1 > < < < 1 ^ cl, 

<c<o<coi-3C3<'<e- 

<Q>Htt l O> > 3cncL.cn 
e-i>co<corcco^:co*<a, 

< £h > h Ci, (J W I £h I C-> 

hcoco W2;cn< 1 Q 1 Cm 

S O S J I CD I 1 

CO CO _3 _3 CO 1 CO CO > 1 

J ^ J H J I CJ S CO I 

CL, LX, CL, CL, CL, 1 CO »-3 I 

i-i*<<>rf:<<e-,E-t&u< 

< [^3 kJ ^1^ >3 Z3] pu |^3 -3 -3| 

> co < e-« 
_3 cx, co e-» 
[co]> ofco] 

H J e H 

»-3 CL, »-3 »-3 



E-< w &n w > 

E-« CO CO CO CO CO 



I CO CO CO CO CO CO 



Cti J H H H 
> > ,-3 _3 > ^3 



I d_, »_3 »-3 »-3 Pn 



>| ^3 J Du tin 



CL,cocoaarccL,cooc:ooioc: 











t-H 


> 


-3 


-3 


k3 


f— 1 




ojco 


IS] 


CO 


CO 


CO 


CL, 


CO 




CO 



S S 33 S S S S S sl 1 



(NoomaiHo^inr*U)n^ 

H £1 ft Ol r-H r-H QU t-H C% CL. t-H t— t 

goooeeoeooee 
o 000 00 



0 


CD 


O 


0 


CD 


CD 


CD 


CD 






7771 
O] 


2; 




IS 










O 






*i 


0 




O 




0 


CD 


CD 


O 


CD 


CD 


O 






^3 








^3 


< 




-J 


I*. 


fx, 




IX, 


IX, 


(X, 


IX 


IX, 


IX 


H 


»— t 


E-* 


CO 




e- 


CO 




E-« 




E- 


IX, 


O 


i-3 




^3 


t-3 


.-3 




^3 


^3 


> 


^3 


k3 


Q 


CO 




O 






Q 


C3 


CO 


2: 


< 






0 


O 


0 






CD 


CD 


CD 


CD 








6- 


e- 


E- 


< 




CO 


CO 


< 






e- 


e- 


E-« 






< 


E 


1 


1 


1 
1 


1 


t 


t 
1 


1 
1 


1 


1 


•< 
O 


> 
t 


1 
1 


I 

Pi 




Q 




03 


1X1 


O OJ 




1 


(X 




£5 


CO 


CO 








t> 


^ 


2; 


53 


CO 


ex. 


IX, 


Cju 


tx« 


IX, 


IX 


IX, 


IX, 


IX 


t—i 


IX 


O 


O 


0 


CJ 


0 


0 




0 


CJ 


IX 


O 


CO 




CO 


CJ 


CO 


co 


6-* 


CO 


< 


> 


co 



1 I I 1 I I t I I I CO 

I eh &h &n Hco[eTjco ^ <[i^] 



HH^3^3JJ^3»— iO^vJQCr: 
<C<Cco<*<cl,co8-«<Cixico 
&^<E-iE-«^E-«CD[X1 1 Ot-H 
CDCOOCDCL,CDCDCDCOQ<: 



I CD CD CD CD CD CD CDI^ Q ^ Q 



CL t ^3^i;rf:<:CL J CL,CL,>> ( _3 

WjHHHfeIt,WHHH 

*-3&-*co>hcoix,k3i-hcopc:>* 

Eh 

> 
2: 



^>>>>>>> >|^-3 
Q q[^]q Q (X)r>lQ Q Ol 
CD CD CD CD CD CD 



CDEhQC0E-<E-*«£-<QE^.-3C0 



^-3 IX, IXl 



Ci4£-t»-H>2;COCO(X,CO»-HI-H 

J |>H >, >H >4 >H >7|>[>H >H 0| 



2; q fi^""^ 



8- E-« Eh E-4 

O 



CD CD CD CD 



CD £-< CO 21 Q 

<; i ,< < < 

Q 1 Q Q 2; 

W 1 CO < E-« 

O* 6-t Eh Eh Eh 



EhICD CD O 



< CD < 1X1 (X 

<C Q O CU CLi 

Q E-h > >» Q 

E-» < 2; O CO 

£h CO >h E- 1 t-3 



COEh<COCXCOCOCOCO I 6^ 
W 2: < I <C Er< I ^ Pi H h 



CNJ CO LO CTv t-H CD -^P LO r~- VO m -^Ji 
H a C3, t-H t-H d, t-H a^H t— H 

eoooeeoeooge 
o 000 00 



WO 98/58953 



9/21 



PCT/DK98/00266 



'CNcvir-^^rcno^oonooH 
nnrironcMcMnnrM^i 



oo cr\ cn cnj c — ^ V£> 
r-~ c — oo r- t — t — 



CO oo 
c — to co 



O CO PC Q g CO CO < CD I 
CX, CX, £X, CX,|EHj"g7jc 



I CXI 



cx, cm cu|q 



< < < < cv w h ^ o w e-< 

CO 
W 
O 





CX 




EX, 








EX, 


(X, 


IX, 




CO 




oc 


CO 


CO 


CO 


CO 


CO 


CO 


Eh t-H 


-3 


t-3 


t-3 


t-3 


1-3 


i-3 


-3 




t-3 


-3 




CO 


CO 


Eh 




>H 


t-3 


i 


<c 


Eh 


>H 


o 


CO 


CO 


CO 


CO 


CO 


CO 


Ll 




CO 


CO 


CO 


IX, 


[X, 


IX, 


IX, 


ix, 


Pm 


IX, 


IX, 


IX, 


(X, 


[X, 


o 


CD 


CD 


CD 


o 


cd 


Q 


cd 


o 


H 


]o 


> — ♦ 


Eh 


|co[^]co 


CO 


Eh 


CO 


CO 


co 


CO 




-3 


Cx-i (X, JX, 


IX, 


IX, 






1-3 


cx, 


Eh 


l 




1 


CO 


1 


CO 


Eh 


q£ 


e- 


CO 


eh 


CO 


Eh 


E-« 


♦J 


Eh 


-3 


> 


«< 


i-j 


i-3 


i 


1 


i 




1 


t 


1 


1 




i 


cx, 


i 


1 


I 


1 


1 


1 


1 




i 


i 


cj 


CO 


t-3 


h-i 


»-3 


-3 


.-3 


52; 


Eh 


Eh 


Eh 


-3 






CD 


< 










< 


Eh 


(XI 






CO 








CO 




O Q 


cx, 


> 


Q 




Q 






cd 


<C 




CD 




:> 


Eh 




*C 


•< 


< 




Q 


> 


Cv 


CO 


£h 




Eh 


en 


^« 


> 


txi 


o a: 


Eh 


CO 


> 




Eh 




Eh 




CO 


CJ 








CO 


CO 


CO 


Eh 


CO 


< 


cd 


o 


CO 


> 


> 


t-3 


> 




IX 




> 






»— 1 


CO 




< 


> 


< 






< 






CO 


< < 


> 


< 


CJ 


< 


f-H 


<c 


< 


< 


t— 1 


cd 


o 




CD 


s: 


cd 




CD 


o 


CD 


< 


< 


w 


< 


< < 


<< 


<c 


co 




1 


< 


1 


1 


1 




i 


1 


1 


1 


« 


1 


CD 


> 


t 


> 




t 


ac 


1 


< 


Eh 


IX 


Q 


Eh 


< 


Eh 


CO 


i 


£h 


1 




Eh 


CD 


< 


cd 


CO 


CO 


cd 


cd 


cd 


cd 


&H 


CD 


1X1 


CO 




CO 


CO 








<c 


< 


CO 


l.-H 


CO 


Q 




l-H 


o 


q q q a co s: a; 

















F] 


■ |o 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


E^js 




F\ 




CO 








i i< 


CO 




CO 


E- 




Eh 




O 


Q 




a cd 




CO 




i-3 


i CD 


CL, 


Eh 


< 


Eh 


Eh 


CO 


Eh 


fx, IX, 


IX, 


IX, 


►-3 IX, 


IX, 


IXj 


IX, cx 


IX, 


1 Eh 


£h 


1 


> 


( 


> 


> 


> 


»-q CO 


ac 


CO 


t-3 Eh 




Eh 




CO 


1 Eh 


Eh 






1 




< 


Eh 






C*4 


CX t-3 


^3 


IX, 


^) IX, 






1— 1 




I 


1 




CO 


1 


CO CO 


CO 


CO 


O CO 


Eh 


co 


CD S 


> 


I CO 


> 


1 


1 


1 






1 




cx, 




>h pa 




IX, 


tc o 


CD 


1 co 


CO 


1 


Eh 


< 


Eh 


cx, 


CO 



•-3 *-3 i-J i — i i—3 »-3 H 

CO CO CO CO CO -3 -3 

»-3 »-3 »-3 »-3 t-H [X, CJ 

Eh Si Eh CO CO CO 

>4 ^ co <; s 

<C Eh £h E-, o CD CD 

<: < <c < <|> 

CD CD CD CD CD CD CD 
CD CD CD CD CD CD CD 



CD 
CO 



CO 
Eh 
CD 
OS 



IX, 
CD 



s S 2: 2; S Q Q 
< i i Z i i i 
' • I S i i i 

" 1 I < I I I 

QtXitxiixiQ<(jqcDEH 
E^wcoS2;^>eh^ 
[cq]o [co cococorococococo| 
ct,>H<>[t l ti,<ci,o'>-'CK; 

« cnw aoo co q oo >h 

[ix, ujo ^L7pq >|73]HH|tx, IX, cx| 



JHJHUJJ>>> H 
> CD CD CD >h PC! 2Q>4WQ 



^s;coqcos;qq2;s2; 

Q Q CXI Eh Qf 1X1 O t-3 ^3 1X1 Eh 
IX, (X, [X, ^3[hH|^| ^} _g ^|»-H[iI7| 



^Eh>^cosc>coSSJ 



LJJ, 3T3|CJ > >| J -3|>fsl 



CD O CD CD CDlSlfoleH 



E- CD Eh <; e- < co b< t< _3 £h 
[w]cj hh |co co co"co] > prco] < 

1 X^CO«CO>S>H>H> < 
0>HJHJ^HJHH 
CO <C <C Eh <C Eh CO <C CJ < < 
> |CD CD] co |q CD O CD CD CP] cp 



CNCOLDCTiHO^l/lr. 
*-H O4 Q4 O4 ^ — I t— ( Qu T—t CU 

O OOO 



m 

' — s t— 1 

o 6 e 

o o 



w[cp]a 

^ DO, 

I Oh |-l 

1 -3 > 

' J s 

I I CO 

>-H I < 

Q I CO 

H ft a &,*-H t— I CUt-H q* CX. t-H t-H 

o 000 OO 



WO 98/58953 



PCT/DK98/00266 



o^coinoocNOojoocNHLn 

rsjfN(NlNC\CMCNl«HCN«-tCN 



OUiHCNOOOin(\HOOOC\ 

uz> vo vx> vx> lT) r-- co v-o vr> vx> 

CN(N(NCN(N(N(NIrH(MHCN 



Q < E-i E-» S 

CO CD Eh D4 < CO > 

I I I I I I I 

co < < o «< < <C 

£h H i< h H ^ 



s s s s s s s 



q s s S cu co pa 

CO CO CD S co J s 



Uj U Cl4 H, [L, tL, Ct^l 



CO 


i-4 


CO 


> 


CO 


OS 




:> 




_H 




< 


> 


1-4 


w 




t-H 


O CO 


>H 


E-h 


cd 


< 


O 


CD 




CD 


CD 


o eh 


< 


E-h 




Eh 


i-4 










£h 




r-»\ 

Z | 


cd 




w 


O 












CO 


CO 




< 


PJ 


h 


l-H 


J— 1 


t— t 


i — i 


1 — < 


. , 1 


e-h 


Q 


> 


CO 


6h 




E-h 




> 


CO 


<< 


t-H 




»-4 


< 


Eh 


a 




o 


o 


CD 


Q 


O 


cd 


< 


cd 


<c 


* 


CO 




CO 


CO 


Eh 


Eh 




6- 


< 


< 


CO 


co 


Eh 


E~* 


o cj 




>< 




< 


CO 


t — 1 


t-H 


i— i 




Li 




t — i 


<< 


< 


< 


< 


<c 




<: 


cd 


o 


cd 


cd 


cd 


CD 


CD 


cd 


cd 


cd 


cd 


o 


CD 


CD 






O -4 






u: 




be: 




O E-i 




1 


o 


1 


1 


i 








Eh 


t 


1 


i 


i 


i 


1 


< 


1 


1 






1 


CO 


CO 


CO 




en 


CO 


CO 


CO 


CO 


CO 




< 


CO 


CO 






CO 


CO 




CO 


E- 




2: 


53 


23 




55 




w 


CD 


CD 


CO 






o 


CO 


D4 


-4 


Eh 


< 


CO 


CD 








Pm 




H 


-4 


CO 




&H 


Eh 




t— i 


< 


i — i 


> 


HH 


_^ 


< 


<C 


CO 


CO 


Pm 


CO 1 


a 


CO 




o 


a: 


co e 


o 


^: 


Eh 


6h 


Eh 


e- 


t Oh CO 




eh 


co 


CO 




; is 


; Eh 




o 




cd 


o ^ 


J CD 


CD 


£h 




e- 


co co E-h 


O 


J CO 




cn 






> ^ 


LO 



CD S >h 

2 OQ H 



< 
E-" 

0 



P4 
< 

[1 

> 

0 

o» 

i-4 

a* 

CD 
CO 

co 



Cl4 

Eh 

o 

>H 

0 

CO 

> 

CO 

Q 

>H 

CD 
> 



CD 

Eh 

< 



Eh 
CO 

0 



Dm, 
< 

CO 

si 



i-4 

I — I 
< 

co 

CD 
Eh 
i-4 



CO 

I— I 

t 

I 

I 

CO 

CD 

"s 
"o 

E] 

^4 
^4 
CO 

^ 

Eh 

D4 



^4 
^4 

CO 
9h 

z 

Q 

Eh 

IX, 



CJ 










CJ 


Eh 


1 — 1 










h— 1 


^4 


<: 


CD 


•< 


Eh 


Eh 


<c 


Q 


CD 


CD 






CO 


CD 


CD 


CD 






w 


CO 


CD 


co 


CO 


i 


o 


o 


O 




CJ 




i 




>H 








Q 


i 


i— « 


1 — ( 


t-H 




i — i 


CD 






< 


< 






Eh 


CD 


CD 


CD 


o 




CD 


6h 


< 


CD 


CD 


CD 




CD 


CO 


Eh 


2: 


^ 


Eh 






CO 


<< 


Eh 


CD 


<: 






CO 


1 


1 


*< 


CO 




< 


< 


1 


1 


< 


Eh 


Eh 




CD 


! 


t 


O 


<c 


Eh 


CD 


Eh 


Eh 


Eh 


Eh 


Eh ^ 


CO 








< 


> 


< 


Eh 




CO 


US 


Eh 


CO f< 


^; 


I 2 




- 










Q 


CO 


CD 


CD 




CD 


o 




CO 


Q 


CO 


b — 1 




CO 


I- 






Pu 


Pt4 







CO > Du, CO >* l-H 

> J H J H ^ J 

« cn S ai < w 
< H ^ ^ a: 

|g S ^ S ^ g] 



o 
c-> 
> 

CD 

o 
b^ 

CD 

t-H 
< 
Eh 

Eh 
i-4 
< 
D4 

ID 

w 
0 

> 
< 

0 

o 

Q 



CO Eh CO Eh CO CO Eh 
l-H l-H l-H (-H l-H _3 £jl, 
Eh Eh > CO tin »-H Eh 
> CJ -4 CO CO CO CO 
COSSCOCOEh<CCOQ 

WE<WUWUO<Q£XU 
EhCO^Cl.EhQiJCJCOEhEh 

HH t-H KH[^4| HH hH|_q[7H[^4[7H >h 

"<]CD CD O CD CD O CD CD|«< < 



a: 
co 
co 
z: 

>H 

> 

0 

> 

O 
co 
^4 

0 

O 
co 

Eh 
-4 

s: 

CO 
CD 



h a o, a<«-t t-H d. ' — i a, cxr-i ^h 

eoooesoeoogs 
o o o o o O 



[ CD CD CD O Q CD CD CD O CD CP| 

«-! & ft Cl. ' — < t-H Cmt-H OU CXt-H t-H 

eoooeeoeoose 
o o o o o o 



WO 98/58953 



11/21 



PCT/DK98/00266 



oncnoocMOOOcnoooouDcN 

LO LD kD LT> «g( m 

m m m m ro m m csj n cn cn 



CO 


CO 


CO 




CO 


CO j 


ld|co 


col 
co| 


r , 






CO 


CO 


CO 




t-3 CO 


CO 


r , 
fcH 






1 — 1 




»-3 




"[ 




..r 
~l 


151 

rj 


pq 


£h 


> 


Q 


> 






Eh 


CO 


Q ^ 








i-3 




*— 3 




•J" 


t -J 


^1 








CD 


2J 




CO 


2; 






St, 




i< 


3C 


Q 




< 


*S 


oc 


ac 




< 


E- 


<C 


e- 




< 


Eh 


<C 


£-< 


>H 






O 




DC 


CO 


rr: 


>~« 


CO 




















7> 






*c 


<C 


-< 


.< 


< 


< 


<c 


<c 


o 


cd 


o 


CD 


o 


O 


o 


cd 


cd 


cd 


cd 


cd 


o 


CD 


o 


o 


cd 


cd 


o 


o 


< 


2; 


CO 


O CO 


CO 




pc 


o 




Eh 






Eh 


E-« 




O £h 




cd 


e-« 




cd 


> 


l-H 


Cx3 


cd 


CO 


1— 1 


o 


CO 




< 


CO 




< 


CO 


CO 


CO 


CO 


Eh 




> 


Eh 


CO 


35 


^> 












2; 


21 


2; 






a; 


O 






CO 


o 


cd 




o 


CO 


CO 




< 




< 


l-H 


h-i 


CO 




1- 


IX, 




. IX 


• tx, 




> 






[X, 


t-3 


Eh 




: 6- 


* 25 


n 


21 


CO 


> 


> 


E 


]> 


H 




1 *-3 


i-3 


l-H 


.J 







SCOCOCOCxJE-iCOCOO^ 

CVOfW^CD^Q^^oC 
CDCDCDCDQ2;CD2;QCD 



M 


CO 


CO 


CO 


CO 


co| 


E-h < 


CO 


CO 
















Pu 




E - * 


Eh 


Eh 


E-« 


en 


l-H 


co 






> 


> 




_3 




*-3 


t-t ^3 


> 


> 






t— I 


E- 


> 


w 


6- i 




CO 


eh 


< 


cu 


CL, 






O i 






Q 


Q 




E-< 




CO 


e- ( 


> 


p£ 


8h 


CD 




CO 




o 


CO 1 




CO 


CO 


CO 


21 






CO 


CJ I 


Eh 


< 


Eh 








<C 


< 


j i 


CO 




^ 


< 


cd 


O 


CO 




< i 


CO 


CO 


>H 










E-« 


O ) 


O 


o 


< 












O i 


CO 


i-3 












1 


Q i 


E-« 














1 


O » 


cx, 














1 


S " 


i-3 














1 


E-i i 
















1 


CO 1 
















I 


S i 
















1 


W I 
















1 


O l 
















1 


J 1 







CNooin(T>HO'Cinr-^on^ 

H ft ft ft I f — I ftt-H ft ftr-l r— ( 

ft e e e ftftgfteeftft 
eoooaEoeooee 
o o o o o o 



i t i i i i i i i 

i > CO E-h > E-« 6-« i i t 

o o o o cd|ce:|u o o o 







W 




w 


> 


Ci3 


U3 






Cl. 


Pu 


Cjl, 










Cx-, 


-1 


> 


e- 


»— « 


E-» 




E- 


> 


> — i 


e- 
























Q 


o 


2: 


O 


-I 


E 


Q 




Q 


O 


o 




O 


o 


o 


CD 


CD 


CD 


CD 


CO 


< 


i_3 


o o 


E- 


O 


CO 


^ 


CO 


Q 




Q 




t-3 


Ci3 


O Q 


03 




< 


< 


<C 


< 


< 


<C 


Eh 


CD 


"< 


> 


CO 


CO 


CO 


CO 


CO 


CO 


Cju 


CO 


CO 


CO 




i-3 










- 




1 1—1 


CO 


CO 


CO 


CO 


CO 


CO 


;o 


CO 


CO 


CO 


t-3 




1— i 






-3 




^3 


KH 




Cx3 


Cz3 


W 


CO 


CO 


Cx3 


CO 


e-« 


Cx3 


CO 


|o 


O 


O 


o 


o 


o 


O 


CD 


CD 


O 


CO 




CO 


CO 


CO 


CO 


O 


CO 


O 


CD 


Q 




Q 


Q 


Q 




E-« 


Q 


^H 


cm 


Cx3 


i-3 




< 


< 


Q 


53 




Q 


< 



<[co[^C < < |co cxh,| <; <c < 





*< 






< 




< 


•< 


<; 




CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


O 


CD 


CD 


O 


O 








^ 


i—3 




-3 










CD 


i 


CD 


Oh 


CL, 


CL, 


CD 


2; 


CO 


< 






<< 




t 


&h 










&H 


<C 


< 


Eh 


<c 




CO 


> 


CD 


€-» 


< 


Eh 




<: 


re 


Eh 


2: 


Q 


CD 


O 






Q 


CO 


Eh 


Dl, 


.< 








CD 




CO 


> 


<c 


>• 


>H 


> 


> 


<c 


CJ 


< 


> 


> 


£-h 


CO 




CO 


i— I 




PC 


CO 




2: 


2: 


»— c 








2: 


2: 


2: 




2: 


lo 




2: 


PC 




2; 






oc 


CO 


oi 2: 


2: 


CO 


CO 


CO 


CO 






cx, 


UX, 


1 


txi 




{X, 


Cu, 


IX, 


IX, 


cx, 


t-3 


•-3 


ClH 






CO 






i-3 


w 


I 


Eh 


^3 


o 




t— i 


^3 


e« 


Eh 


E- 


i 


Ci3 


Eh 


CO 


»-3 


CD 


> 




CL, 




< 


CD 


CL, 


CL> 



O O CD 



CD CD CD I ( IcDj \ 



I CD PC 
I CO CD 



0* i 
CD 




r-t ft ft ft r— I t-H ftt-H ft ft t— I I 

fteeeftftEftgeftft 
eoooeeoeooee 

o o o o o o 



WO 98/58953 



PCT/DK98/00266 



12/21 



oo>o\o\cy»oOrHocrtovD 



Q Q 


Q 


a 


Q 


Q 


Q 


Q i 


a 




Eth 


i 


CO 


CO 


l 


< i 


£h 


> CO 


l 


CO 




CO 


CO 


> Eh 


r-H 


E 4 Q 


I 


< 




1 


<c 


CO CO 


^ 


EH < 


I 


o 




t 


CD 


Eh CD 




CO #< 


< 


eh 


EH 


EH 


EH 


CO Q 


CO 


CO EH 


CD 


en 


DC 


CD 


Q 


CD f< 


CO 


cd 2; 


CO 


2: 


21 


cu 




Eh CO 


CO 


EH «< 


CO 


CO 


EH 


CO 


Eh 


> CO 


DC 


£H EH 




eh 


EH 


EH 


Eh £h 


CO 


Oh Oh 


cu 


PL. 


Oh 


a, 


Oh 


Oh Oh 


Oh 


a q 


Q 


Q 


Q 


Q 


Q 


Q Q 


a 


>M >H 


>H 


>< 


>h 


>H 


>i 


>H >H 


>H 


ClH Oh 




Oh 




C^i 


Uh 


ClH &H 




>-« Cx-i 


B 




>H 


ClH 


>- 


















> 


CO 




CO 


EH 


eh 


< 


Eh CO 


CD 




2; 


OS 








CO 


; cu cd 


CD 


CD 


» 




o 


O CD 


cu 


CO 


aouiaoz co 


< n 


EH 


^: 


CO 


< 


CO 


<C Oh 


> 


coj< 


<C 


«< 


.< 




<=*: 


< < 




OS Pi 


Oh 


Oh 


Oh 




0 




<* 






-3 






1-3 




•-3 








o* co 


CD 




2; 


EH EH 


-H 


EH 


> 


EH 


Eh 


EH -3 






Oh 


1—1 


n 




3: 


t-H &H 


tx, 














Nh Nh 


o 


< < 




■< 




[51 


« 


*< <c 


<| 


1 £H 


2; 


co 2; 




EH 


i 2; 


CD 


1 CO CO 


CO 


CO 




CO 


CO 




EH 1 


1 


t 


2; 


1 


1 


EH ^ i 


CD CD 




CD 


CD 


CD 


CU 


CD l 


O 








r-H 










Q Q 




>H 


S3 


2J 


ac 








CO CO 


< 




CO 


< 


> 


< CD 


CO 



co[s 2: 

[^4 Oh Oh 



2: s 2; 53 

Oh P< PCi 



3 fc< EH 
E-Eh> 

CD EH CO 

Oh O < 

EH Oh Oh 

Eh I CO 

CO EH CD 

EH EH EH 

I < EH 



cm oo in cn 

0* e e e 

6 o o o 
o 



CO 



2: 2; 



EH E-i ^ 

« EH Oh EH 

CO EH EH EH 

EH O Q < 

Oh co Eh 21 

<C CO CO Oh 

CO CO CD <C 

EH <C EH EH 

CO CD EH Eh 



X ^ X 

CO Dh I — I 

> Oh EH 

Eh k4 2; 

O CO CD 

EH EH Q 

< Oh CX 

CD i co 

i i < 
i 



i 



i 



eeosooeg 
o o o o o 



t — 1 
un 
^ 


r— 




^ 


CO 


m 




t_n 
tn 
rn 


m 


in 
m 


Eh 


2; 


< 


EH 




CO 


EH 


>• 


EH 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


CD 


o 


CD 






CO 


CD 


CD 




2: 


< 


CD 


CO 


Eh 


< 


< 


< 


CO 


> 


^H 


CO 


<C 




^H 








*J 




t*H 






EH 


Eh 




<: 


EH 


CO 


EH 






EH 


> 


> 


> 






> 


:> 


> 


> 


> 


Oh 


Oh 




Oh 


Oh 


Oh 


Oh 


2: 






oaooooo'O'O'o 


• — ^ 




Eh 




O EH 


2; 




Oh 






» — -3 


ClH 






tx, 


i — < 




t-H 


t-H 




EH 


EH 


l—i 


Eh 


CO 


oc 




EH 


EH 


CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 


-1 


EH 


EH 


t< 


EH 






EH 


Eh 




EH 


^3 


»— ] 




Cu 








<* 


CLh 


i—3 




2: 


2: 


2: 


2: 


2: 


2: 


2; 


o 


2: 




Q 


Q 




Q 


Q 


» 




tx; 


W 


CO 




Oh 


< 


< 








Oh 


<: 


Q 


> 




< 






i 


Q 


Q 


o 


< 












i 




2: 


i 










< 


•< 




> 


CU 


w 












w 






EH 


Q 


w 


Q 


<: 


Q 


< 




CO 




CO 


CiJ 


cq 


< 


ClH 


< 


Eh 


w 


1 


a 


CO 


CO 


CO 


CO 


CO 


EH 


co 


Eh 


1 


Q 



co co 

CD CD 

e^jco 

Cxh Cxh 



-4 J 

CO CO 

CD CD 

CO CO 

Ph Cl, 

> > 



^[Eh" 



^H 



Oh 



CO CO 

CD CD 

CO CO 

ClH ClH 



CD 



Uh CO CO 

CD CD CD~CD 

CO CO CO CO 

Uh Cju tx* Ijm 



]>>F 



CO 







CO 


>* 


EH 


EH 


EH 


CO 


EH 


EH 


EH 


CD 


CD 


CD 


CD 


O 


CD 


CD 


CD 


CD 


CD 


EH 


CO 


CO 


CO 


O O* CO 


Eh 


EH 


DO 


>• 


>H 


>H 


>H 


>H 


>H 






>H 


>H 




EH 


Q 


< 


Oh 




CO 


CO 


EH 


l-H 


EH 


2: 




Oh 


2; 






CO 




CO 


CD 


Oh 


2: 


♦ — -3 


1 


2: 


2; 


t< 


CO 


2; 


CO 


CO 


CD 


< 




Q 




CD 


o 


CD 


CD 


2: 


< 


CD 


2: 


CD 


Oh 


CO 


<< 


<C 


< 


CO 


_H 


<C 


< 


Eh 


Q 


Oh 


Oh 


Q 


Q 


Q 


Q 


CO 


co 


Q 


CD 


< 


EH 


< 




Oh 


Oh 


CD 


> 


Oh 


2: 




CO 






O CD 


53 


CO 


CO 


r-H 


r-H 




2: 


2; 


2: 


2i 


2: 


2; 


2; 


2: 


1 


> 






l-H 






r-H 


r-H 




1 




2: 


co 


EH 


2; 




oi 


2: 


o 


I 


•-H 




H 






l-H 






■ 


> 


8duio 


! gduio 


omp9 
ompll 
omplO 


o e 

o 


vjo ro 

ESQ, 
O O E= 
O 



WO 98/58953 



PCT/DK98/00266 



13/21 



> O C — -^T" 

o cn cn 

LO ^ -^J" 


CF\ 

■^r 


CT\ OO 


t-H 

en 


lo c-- 


m 


Q CD Q 


CO 


^ CD 


>• 


ct 


CO 




*-h|-3 -q 


-3 


►J J 


t-n 




-3 


i 


| CO CO CO 


< 


CO CO 


Eh 


CO CO 


I 


co|q q 


CO 


Q Q 


Q 


a 


Q 


co 


[H > KH 


t-q 


> >pC]HH hH 


t-H 


3 cl, is 




s 2: S £2 Q 


.J 




> CO *< 


> 


> CD 


CO 


co 


< 




J J J 






CO 




►J 


-1 


s o o 


^ 




£5 


^ 


CD 




s; Eh s 


Eh 


S £-. 


Eh 


Eh 


£h 


o 


(-1 j j 


i— D 


t-H t-H 






t-H 


> 


Eh £-. £-« 


CO 


Eh Eh 




CO 


< 


oc 


CO > > 














Er-i CO CD 


< 


CD co 


Q 


CO 


o 


<: 


|Q U W 


CO 


Q CD 


co 


CO 


CO 


°l 


< eh < 


Eh 


< <C 


Eh 


Eh 




CD 


l CO CO 


Q 


t Eh 


CO 




CO 


CO 


CX- <C <C 


<: 


EHh< 


< 


<c 


o 


co w 




CO CO 


o > 


t-H 




|-q _3 Cl, 


t-q 


-q ^ 


•— 3 




»-3 


^1 




Eh Eh 


Eh 


Eh 


Eh 


Eh 


Eh 


Eh 




CD 


CD CD 


CD 


CD 


CD 


o 


> < cd 


CL, 


<C CO 


i-3 


•-J 




(X, 


Q Q Q|ojQ Q 

s s s: s: s s 


Q 


Q 






*-3 








COt-H > 


-J 


^ CD 


-q 


t—i 




Eh 


0>> 


■J 


»-3 J 




-J 






oc: co m 


Eh 


Eh J 


Eh 






Eh 


|co CO CO 


CO 


CO CO 


CO 


CO 


CO 


CO 


Q CD CD 


CD 


CD CD 


CO 




CD 


CO 


«< «< CO 


co 


cl, < 


ou 


< 


CL, 


< 


O Eh > 


Eh 


CO CO 


CO 


Q 


CO 


Cx3 


acxocooiaoa 




^ h h 


Eh 


CO CO 


CO 


CO 


Eh 


^: 


|cl, pl,]»-h 




&-I IX. 






Cl, 




< O En 


O 


CO CO 


o 


CD 




CO 


o ^ a; 




J< Eh OS CO K 


Eh <C 


> 


<C CO 


Eh 


< 


> 


t-3 


CXQ > 


Q 


> co 


t-H CO 


Eh 


O 


-q ,-q > 


-q 


-4 -q 


_q 


-q 


> 


.-q 


EH £h Eh 


wIEh Eh 


ehT 


> 


Ci3 


Eh 


>>>>>> 


> 


> 


< 


> 


CD CD 0|S| 


CD CD 


CD 




CD 


CD 


s o; q u 


CO ^ 


O CD 


CO 


Q 


^ ^ ^ 




J J J 






CO 


-q -q _q 






> 


co 


to > > 




CO J 


to 


> 


•J J J 




i-3 .J 




> 






oo t_n en 

a a 
see 

o o o 


ompll 


o lt> ( 

rH (2L, t-H 

e o g 
o o 


omp/ 


Cl| T-H T 

g a 
o g 
o 



i_n L/i in in in i_o ^ lo ^ cnj 



•J ^-3 Dl, j|hh ^» j j j 



Cx30CO>EH^fc£;ci3CO ( J 

J>H>,J J> J JQ, 

QCOCL,CXiCOCOCOCDCLiE-i 
1 ' t I I I I O i Cl, 
O^Cl, Dt« >i >. U, 



>h Cjh. Cl, 



COQ>Eh>JQcoEhCi3 

o o c>|^[oI[^ 



3' 



^ E- 0< Q CO Q CO [i. 
KUCOSScooiSttCL 



COQSEHCOSEHCX.piQ 

SS3GCOQCOS^Q^> 
Ci3 Cx3 Ci3 CU ClJ W Cxjlodwico 
>->h>«>h>^>^>h>. s] 



IX« 




>H 


Cl, 


> 


t-H i_q 


Cl, 


< 


>H 


Eh 








2= 


S << 


CO 




< 


|o 


CD 


CD 


CD 


CD 


CD CD 


|*3 [tD] W 


Eh 


O O CO 


CO 


CO Q 


Q Eh Cl, 


CL, 


2; 


•< 


CO 


CL, 


»-h <C 


CO 


CL, 


< 


Q 


Q 


Q 


Q 


Q 


Q S 


t-H 


CO 


> 


«-3 




> 


O > 


»-» > 




t-H 








i-3 


Cl, 


i-q 


^3 -J 


-3 


>-3 


-I 


Eh 


-J 




> 


CO 


S <C 


> 


CO 




t-H 


t-H 


t-H 




»— 3 


»-3 *-3 




t-H 


CO 


Eh 


CL, 


CL, 


CL, 


CO 


fc<5 Eh 


CL, 


CO 


^3 


CD 


CD 


CD 


co 


CD 


CD CD 


pc: 


Q 


CO 


co 


CO 


CO 


Eh 


CO 


CO £h 




Eh 


> 


H 




•J 










> 




Eh 


< 


< 


Eh 


Eh 


»-« Eh 




CO 


< 


•-3 


> 


> 


t-H 


> 


> »-H 


»-H 












Eh 


Eh 




Q 


O Q 


H 
















CO 


CO 


CO 




CO 


co s 




Q 










< 


< <c 


<c 


< 


CO 


< 


< 




CD 


OCX 


Eh 


Eh 


< 




CO 


Eh 


< 


Eh 


^ CO 






or: 


Eh 


< 


<C 




< 




< 


< 




W 


< 




W 




Eh > 


CO 




oc: 


h 


t-H 


t-H 


t-H 


h-3 


CO s 






H 




> 


t~H 


CO 


Eh 




> 


CL, 


i 


> 

1 


< 

1 


> 
1 


< 
1 


> • — ' 
1 1 


t-H 
1 


< 


> 
Cl, 








CO 




CL, CL, 




Eh 


3; 


t< 






ixZ 


X 






CO 


Q 


<: 


CD 


Eh 


23 


Eh 




CD 


CO 


Eh 


o 


W 


CD 


CD 




•-3 CD 




CO 


Q 


omp 8 


omp 5 


omp 9 
ompll 


omplO 


omp 4 
omp 15 


r— 
cu 
g 
o 


omp 6 
ompl3 



WO 98/58953 



PCT7DK98/00366 



14/21 



cr> CD * — i csj i — < m o m vx> tn csi 

<T\ OA OO OA CT\ OO r-H CT\ C — CT\ UD 



C3 <o t — < vo cni , — i ro CD m 



< 


< < 


o 


< 




<C <| 


o 




E- 


Eh S3 


CO 


< 




O fc< 


> 


cc: 


Eh 


^ ^ 








CO 








co 


Eh Eh 


<c 


CO 


Eh 


Cm 


Q 




Q 


< 




Eh 




55 


O* Q 


o 




o 


Cm <C 


CO 


Eh 




Eh 


Eh 


Eh 


Q 


eh 


Eh ^ 


E-h 


1 


i 


O 


CO 


O 




o 


CO »< 


I 


1 


i 


i 


1 


I 


CO 




< i 




< 


< 


i 


1 




Cm 


> 


E-« l 


i 


Eh 


Eh 


Eh 




i 


Cm 




Q O 


l 


a 


Q 




1 




>h 


Cm 


Q Q 


Q 


Da 


Eh 


Cm 


1 


i 


pq 


o 


> > 


< 


CM Eh 




1 


Eh 


pq 










3= 


l> 


2£ 


3= 


si 




Eh > 


Eh 


CO 




o s 


< 




i 


S HH 


< 




> 


> 


i— i 






i 


O t— i 




< 


55 




Eh 






l 




I s 




3: 










I 


Eh E- 


rr 




o* a: 








O 


o o 


o 


o 


o 


o 


O 


O 


rol 
C£J 






>- 


>H >H 


>• 


X 


Pm 


>H 


>H 






o 


O O 




O 




o o 


o 




>H 


>H >H 


X 




>H 


>H 


>H 


>H 


Q 




re ac 


n: 


rc 












Eh S 


cu 


i— 1 


CO 


Cm 


CO 


Cm 


pq ' 




Cm E-< 


w 


Cm 


55 


Pi 


CM CO 


Eh 


w e-< 


Cm S 


Oh CO 


> 


> 




CD < 2 


Eh 




a 


Cm 


< 


Cm 


>. 




: > -a 


Of Da 


pa < 






O* 


I— 


1 £h 1— i 


> 




< 


. > 


1—1 


Pm 


Eh 


CUi Cti o. 


. a. 


Ol, 


a. pa i 


1 


Pm 


c 


t 1 1 


CO Q 


> 




1 


1 


PQ 


r 


1 1 


E- 




1 


i-3 


i 


1 1 




1 < 


i— 




t 


1 


Eh 


i 


> e 


i ^ 






1 CM 1 


1 


Eh 


cu < c 


\ <£ 


: q 


CO co co 


Q 


W 


e-i a, a. c 


1 Eh 


CO <c 


: ^ 


O 


o 


> > i- 


1 1- 


H 1— 1 


1— 


< Q CO 


< 






Q CM Q 


Eh 




E-» Eh Eh »— « »— i 


> E« <C 


> 


t-J 


CO Eh E-i Q ^ 


55 »- 


4 CO 


Eh 


CO 


E-* E-* £-« cn i 


Eh *- 




> 


Q 


I 


I 1 


t 


1 


Q E-< > 


CO 


Pm 


> < s < <c 


> O HH 


O co 


£h e-» e« < cl, 


Q O Q 


O Cm 


O U U E-» Q 


< < « 


< 


O 


CO _a Of <C O 


Q O U 




< < <c < < 


> Cm < 


CL 




1 W CO W £h 


Eh CO Q 


Cx3 PlJ 



< o 
oc cr: 

Q Q 
O CO 
CM CJ 

O Eh 

pa j 

<C co 
Eh a: 
pj pa 

J > 

>H O 



Eh pr; 
CO O 
CO Ol 

CO >< 

CO CO 
CO S 
Eh < 

co O 

C< CO 
Q Eh 

CO O 



o o o 

CM Di 
Eh ^ pj 

w x o 

00 H 

Cm Eh O 

> <C CO 
^ O CO 
Eh f-H S 
< U3 > 

> > 33 

J J H 

01 o Pa 



as o 

i< CC 
PJ Q 

>h £h 

Cm CM 

<C J 

O CO 

pa co 

Eh <C 

o 

Eh 

pa Pa 
h-t o* 

Dm > 
S CM 



x|o OOP o o o|s 



JHHJHJH[UJ 
CO«<COEhCOCO*<CmCO 

co o4pm pm cc: pc; pc; os cc 



l-H 


l-H 


> 


♦— < 


> 


-1 


0 




Q 


Q 




Q 




Q 






1— 1 


CO 


> 


Eh 


> 


Eh 


> 


t-t 




Pm 


Pm 


Pu 


Dm 


Pm 


Pm 


H 



<cowcow>wcos 
o|<|o o o o o 

[S 5: S S S S S S S 



COCOCOCOEHEHCOi-ai— \ 



2: 






Q 


55 




55 








Cm 


Cm 




Cm 


< 


-1 


Cm 


Cm 


Cm 




> 


> 


> 


> 


> 


> 




> 


> 




-M 


•~a 


^1 








•J 


»-a 


^a 


Cm 


CO 


Cm 


Cm 


CO 


Eh 


< Pm 


Cm 




Eh 


o 


o 


O 


< 


O 


CO 


o 


< 


o 


CO 


> — 1 


o* cm a: 


PS 




ex Pa 




^ 


PC 


PC 


Pm 


Cm 


Pm 


Cm 




CM bM 


1— 1 


pa 


pa 


pa 


PJ 


W 


pa 


pa 


pa 


Pa 


Da 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


<] 


Cm 


Cm 








55 


s l 


[w] 




Eh 


~PC 




Cm 




Dm 


Cu 


Cm 


Cm 


Cm 


Cm 


Cm 










55 




> 


-a 






Eh 


>H 


>H 


>H 


>H 


>H 


Dm 


>H 


Pm 


>H 


^a 


o 


o 


O 


CD 






C3 


CO 




Dm 


Eh 


Eh 


Eh 


Eh 


Eh 


Eh 


Eh 


CM H 


> 


^C 






Eh 




^ 


PC 


^S 




Eh 


Eh 


Eh 


Eh 


> 


Eh 


Eh 


> 




Q 


^M 


















3= 






< Eh £m £m 


Eh 




CO 


U4 


Eh 


Pm 




-a 






l< 




> 


Dm 


Cm 


Eh 


E-« 


Eh 




Eh. 


Eh 




E-« 


pa 





t-H Cm Cm Cm *-H ftH Cm CU t—i 

a.egeouCMecMeea.Q, 
goooggoeooee 
o o o o o o 



oi co in cr\ r—i q lt> c — vo ro 

H a ttrH t-H ClfH Cm Q<iH t-H 

aeeecMCMgcMeeaa 
eoooeeogooge 

o 000 00 



WO 98/5*953 



PCT/DK98/00266 



15/21 



LO 
CO 


CD 

cr\ 

v*o 




vo 
oo 


cn 


t— ♦ 


m 
oo 


cn 
<z> 
r-~ 


m 

CT\ 
LO 


U3 

r- 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


•-3 


-3 


-3 








-J 








o o o o» o 


°1 


U 

1 


O O O 


o 


O 


CJ 


CJ 




CJ 


o 




*■■*« 


CO 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 


Cm 




<: 


< 


< 


<C 




«< 




< 


CD 




&L, 


< 


Cm 


Cm 


^> 


■< 


Cm 




»— i 


"21 




CO 


CO 


2; 


CO 






53 


CO 


CO 


i-3 


•J 


i— i 


Cm 


Cm 


t-H 


Cm 


> — 1 




e- 


O t— ( 




Cm 




.J 






E- 


>- 


Q 




2; 






s: 


Q 




Q 


ac 




Q 


cu 




Cx] 


Q 


Q 




o 




< 


CO 


CO 


CO 


CO 


CO 




CO 


CD 


Cm 


ex, 


O 


o 


< 


Cm 


<: 


Cm 


Cm 


Cm 


Eh 




^ 




£h 


Q 


-a* 




*C 




i — \ 




ac 


O Cm 


Cm* 




ac 


ac 


PC 


W 


< 




*C 


Cm 


< 


E-« 




£h 


Eh 








<: 


CD 


CO 




CO 


CD 


CO 


2; 


l-H 


o 


cd 


CD 


o 


< 


CD 


> 


*c 


> 




o 


cd 


CD 


O 


o 


CD 


CD 


CD 


CO 








R 


> 


a> 


»— ( 


> 


> 


-J 


< 


> 


< 




l-H 




> 


•J 


> 


> 


>H 


>H 


>< 


>< 


>H 








[«] 


F| 


cd 


cd 


cd cd 


o 


o 


cd 


CD 


CD 


CD 


cd 


cd 


cd 


< 


>H 


< 


CD 


> CD CD 


CO 


CO 


CO 


CO 


CO 


CO 








CO 




»-3 


« 


CO 




*— 1 




CO 


> 




3d 


3= 


DC 


ac 


a: 


ac 


ac 


a: 


ac 


ac 


cc* 


oC 


PC 


PC 


Cm* 


pc 


cm* 


cm- 


a: 


ed 


5m 


Cm 


>H 


>< 


Cm 


Cm 


Cm 


>H 


Cm 


>• 



PC CD 

S PC 
Q E-« 
W ^ 
Eh 
CO 
Q 



U£ CO <C 

oc; « o 

^ O s 

CD CD CD 

« « W 

^ o ^ 

Q Q Q 

>C W ^ 



CD CD 

»-< S 

Eh O 
CO CD 

a t-t 



cd oc 
x as 

W 2 Q 
23 W Cm* 
H ocj CO 
fc< CD < 
Q CO CO 
Pi fm* > 



Cm »-3 t-3 Cm 
Cm Cm Cm Cm 
25 sfQls 



Cm i-q 
Cm Cm 
S 23 



J J ti, 
Cm[>]cm 

S S 2; 



CO 


< 


< 




CO 






CO 


CD 




> 




Eh 


t— I 






l-H 


h-i 


CD 


CD 


o 


CD 


CD 


CO 




CD 


O 


< 


*< 


CO 


CO 


W 


CO 


< 


< 


Q 




< 


> 


< 


CJ 


> 


CD 


> 





Cm Cm J 



JpH^CmJ>]cm 



cnj co m cj> <— t o> en r-- vx) m -^ji 

a. e & s a a £ a. e e a p, 
goooEgoEooee 
o o o o o o 



to 
oo 


^T 1 

r-- 


m 


LO 
CN] 


LO 


CM 

r- 


CO 


CM 
LO 

r-- 


cn 
vo 


VJ3 

r- 


CO 


Cm 




CO 




w 


Cm 


CJ 


CO 


Cm 


k— 1 


> 


CO 






CO 


Cm 


<c 


> 


PC 


W 


>H 


CD 


CD 


Q 


CD 






>H 


PC 


CO 


CO 


Cm 


CO 


Eh 


Cm 




CO 


Cm 


CO 




^1 






M l 




l-H 






1 


> 


CO 




I-H 


CO 




>H 


< 




►J 


l-H 




o 








1 


CO 




l-H 


CD 


Cm 










1 


Cm* 


1 


w 


a> 


< 






( 




1 


CO 


t 


»: 














1 


Cm 


1 












( 




1 




1 


CD 




< 










1 


<C 


CO 


CD 


Cm 






( 






| 


PC 


CO 


W 




CD 


( 


Q 




CD 


Cm" 


CD 


Q 


CD 


PC 




PC 




*< 


Cm 




-3 


Eh 




CO 










CO 


, ~) 


PC 


>H 






Cm 




o 


i 


Cm 


CO 




Cm 


l-H 


>H 




CJ 


CD 






Cm 


X 


CD 


CO 


PC 


< 


CL, 


t-H 


i 


Cm 


I 


£5 


Oi > 


Cm 




i-l 


Cm 


l 


»-3 


J 


O Cm 


> 


l-H 


Q 


CO 


CD 


Eh 


CD 


CO 


Cm 


CO 


CO 




Cm 


l-H 


CO 




CD 


CO 


o cx: 


>H 


CD 




- 


CJ 


CO 


^ 


*-3 




Cm 






CD 


Eh 


w 


w 


Cm 


Eh 


Eh 


Cm* 


CO 


CO 


Cm* 


Cm* 


6h 


CD 




< 


ac 


^3 




Eh 


Eh 


2: 


l-H 


•J 






CO 


Eh 


Q 






ac 




rc 


DC 


rc 


ac 












w 


O Oi 


Cm 






>H 




•J 


Cm 


^} 




>H 


>H 


>H 


>H 


[coj 








Cm 


c< 








Cm 










> 


l-J 


>H 


CO 








CO 


CO 


Eh 


> 


CO 


CO 


<|CD CD CD|«< 


< 


o 


CD 


CD 


o| 


CD 


O 


< 


< 


*-] 


< 


CD 


CO 


< 






>H 


>H 








>H 


>H 


>H 


-1 


Eh 


> 




> s <c 




CO 


£h 




Q 




Q 




n: 


CO 


Cm* 


E~ ' 


UC 


cd 


O 


CD 


Eh 


Eh 


CO 


<c 


CO 


CD 


<c 


>H 


ac 








cti 




ac 


Cm 


W 


















*l 










w 




aa 




Eh 




CD 






< > 55 


< 


CO 




CO 


Eh 


> 


> 


> 


hH 


l-H 


l-H 


> 




> 


l-H 


Cm 




•J 


Cm 


Cm 


Cm 


> 


Cm 


< 


3C 


>H 


Cm 


3d 


-3 


3C 


O 


>H 


>H 


>H 




Q 


Q 


Q 


Q 


Q 


Q 




Q 


CJ 


a: 


0^ 




^ 


^ 


E 


* 


« 


>C 


^ 


l Q 


Q 


a 


Q 






Q 


O 


Q 


Q 


a; 


PC 


CO 


>H 






Cm 


PC 


ce: 


PC 


<C 


CD 


CD 


CD O 


CD 




CO 




CO 


a, 

o 


8duio 


gduio 


6duio 


ompll 


omplO 


•^r lo 

e a, 

O £ 

o 


o 


gduio 



LO CM 



o o 



WO 98/58953 



PCT/DK98/00266 



C> OO OO (Ti CT\ VO 

(N r- ^x> o 



i> in Hoocooocooooooor^coinoj 



cd 


CD 


CD 


o 


CD 


< 


O 


cd 


o 


cd| 


1 t t-H 


> 


Cm 




> 


»— i 


•-3 


Cm 


>-« 




-3 


Cm 




o 




1-3 


i — < 


CM 


-3l 


o 


. . U 


,-3 


►J 






■—3 


*J 


-3 






Q 


»| 




CU 


CU 


cu 


CD 


cu 


cu 


cuj 


I I cu 


PC 




o 


CJ 


Q 


DC 


Q 


DC 


CO 


< 


-3 






•-3 




o 


.-3 


cd 




I t CO 


CO 


CO 




CD 


O 


CD 


O 


CO 


CD 


O 


< 




o 


< 






o 




< 


1 1 X 


CO 




CO 


Q 


CO 


» — ( 


CO 




£5 


Cm 


Cm 


Cm 


Cm 




o 










1 t s 


cd 


Q 




Q 


CO 


E-h 


CO 


CO 


< 


CO 


< 


< 


CO 


O 


CJ 


> 


> 


cj 




Cm 


Cm 


Cl, 


Cs-i 


Cju 


Cl, 


Cm 


Cm 


Cm 


Q 


Q 




Q 




Q 


U3 




>< 




i i < 


cu 


CO 


CO 


< 


CO 


CD 


DC 


Q 


DC 








12 


2: 








o 


L 1 




oc 


DC 


PC 




PC 


PC 


DC 


DC 


PC 


pc 


o 


cd 


O 






CO 


< 


o 


oc 


i I cd 


<c 


CD 


CD 


< 


> 


O 


> 


CO 


CD 






3: 




3: 


3= 




3= 


3: 


3: 




1 


t 




1 


•J 


i 




t— i 




co 


CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 


CO 




cu 




W 






Q 




1 




o 


cd 


CD 


O 


cd 


CO 


cd 


cd 






1 1 < 






Eh 


< 


E- 


CO 


o 


< 


< 










o cu 


cu 


o dc 




t I DC 


CD 


o 


CD 


CD 




CO 


CD 


CD 


Eh 




> 


> 


> 


< 


o 


CO 


< 


Q 


t> 


t 1 DC 


ex :*c 




CO 


DC 


CO 


G - ' 




Eh 


HH 


E-« 


cu 




cu 




cu 


CU 






. , |» 


CU 


w 




w 


CU 


Cx3 


CU 


cu 


w l 


CO 


CX, 


Dm 




J- 


cu 


CX, 


cu 




F 


t 1 >H 




CO 








Pu, 


^C 


> 




55 


>H 




>H 


>* 




>H 


-3 






CM 




PL, 


Pu 


Cm 


Cm 


Cm 


Cm 


Cm 


Q 


Er* 




CD 


CO 


O co 


E- 






I l Q 


CD 


CO 






CO 


CO 


Q 


CO 


Q 


eh 


Eh 






Eh 


Eh 






I- 


•< 


ii OW 


Q 








CU 


a 


CD 
















1 


CX, 


i 


i » [a oaooocoio' 


c 
















1 


Dm 


i 


1 I DC 


DC 




DC 


a: 


DC 


CO 


DC 


DC 


>H 
























< 


1— 1 




CO 


1— 1 


> 


< 


CO 



















1 


cx, 


1 






>H 


>H 




>< 


>H 


>H 


>H 


>H 


>H 


>H 
















t 










»— 1 


>f 




I — i 


>f 








D> 


> 




>H 


>H 


>H 


>H 


>H 


>H 


H 


«( 


E 


; 




>H 


Cm 


-3 










CJ 




•-3 


>H 








DC 


>< 


DC 




CU 










CU 


o» cu 


cu 


o* o o 


Eh 


Eh 


E-t 


Eh 




6h 


Eh 


&H 


&H 










^3 


^3 


^3 


Cm 


> 


> 


i-3 


> 


^3 


^C 




-1 


Eh 


Q l 


-[ 




5>M 


^c 


^C 




■ 








*l 


fcC 


^c 




*l 






s: 


i-3 




















> 




t— * 


^3 


^3 


i— i 




^3 


> 




DC 


Q 


Q 






Eh 


DC 


Q 


DC 


o 






Cm 


Cm 


>H 


Cm 


Cm 




2j 


Cm 


Cm 


Cm 


£5 


2: 












s l 




1 


; 




cu 


CU 


cu 


CX, 


CU 


Cm 


Dm 


Dm 


Dm 


Dm 


Q 


Q 


CO 


Q 






Q 




Q 


Eh 






CU 


53 




Eh 


Cm 


Cm 


1— » 


CO 


E- 


HH 


E-i 


Eh 


> 


E- 




CO 


CO 


DC 


&H 


< 






>l 


>H 




=1 


>• 


>H 


Cm 


>H 


^ 


<C 




DC 


DC 


-1 


>H 


-I 


DC 


DC 


DC 


DC 






cu 






CD 


< 


Ei 


>H 


CU 


CD 




Eh 


CO 


CD 


CO 


i— i 


CO 


CO 


CD 


CD 








CU 


Q 


Q 


O DC 


*C 


Cm- 


DC 


Ot 


>H 


>H 


>h 


>H 


>* 


>H 


Cm 


>H 


>H 


>H 


1; 




—3 




1 


> 




Cm 


Cm 


Cm 


Cm 


Cm 


CO 


CO 


<C 


<: 


CO 


CO 


CO 


CO 


CO 


^< 






■-3 


^3 


Cm 




Cm 


*-3 


-3 


^3 


Cm 


-3 


Cm 


l-J 


*J 


Cm 






>l 


^3 


1-3 


CJ 






>H 


< 


CJ 


CO 


Cm 


CD 


Dm 


Cm 


CD 


DC 




s; 


O DC 


o o o o Ot J 






CX, 


CO 


DC 


DC 


< 


CU 


DC 




DC 


o 


< 


CD 


o 


<C 


< 




> 




CD 


Cm 






> 


cu 


»-3 


DC 


CU 


DC 


Dm 


CO 


CD 




a 


CO 


cu 


S5 




Q 


Q 


Q 


>H 


DC 






CO 


Q 


>H 


CD 




CO 


Sm 




CO 


cu 


h 


Cm 




Cm 




Cm 






Cm 


Cm 


1' 




1 


r 


CU 


CO 


CXi 


►J 


CO 


Dm 


Eh 


Cm 


CO 


-2 


> > J <C 


Eh 


CO 


HH 






t— 1 


-3 


CU 




i-3 


< 


.-3 


>h 




> 


t 


t 




i 


i 


i 




1 


1 


S 






> 


CJ 




> 


>H 


Eh 


> 


>H 


CU 


i-3 




> 




Cm 


•-3 


> 


^3 


> 


CJ 


^3 






Cm 


t-H 


CO 


> 


-3 


DC 


Cm 




> 


i-3 


h 




cu 


Pm 


ex. 


ex. 


CU 


> 


CL, 


Cm 


1' 




Cm 


CXi 


DC 


cx, 


< 


Dm 


CU 


E-> 


< 


Dm 






^c 




I — I 


O t— l 


CM 








^3 


< 


CO 




^3 




f-3 


i 


> 






CU 


DC 


CO 


Q 


cu 


cu 




E- 


O i 




CO 


DC 


co 


< 


CO 


CO 


Q 




DC 


co 


O 6h 


CO 


Q 


NC 


CO 


PC 


O 










< 


CD 




CD 


CD 


CO 


h3 


< 


Eh 


CD 


CNI GO 

a* e 
e o 
o 


in (Tv r- io^ru-»r— 

ooggogooe 
o o o o 


a, 

e 
o 


T— ( 

e 

o 


gduio 
gduio 


& CX 

o e 
o 


omplO 


Cm r-( 

e a 
o g 
o 


o 


Dm 

E 
o 



o o 



WO "98/58953 



PCT/DK98/00266 



17/21 



t — VD VO VD OO VT) VO n CT> O (N 

04c — t — u3r-r^r-cr\CDi>iH^ 

rg CO CO CO CO CD CO CO c — oo LT» oj 



[ — ' vx? v_o oo ud rn cn cz> cm 

CT\ CTt CTa o~\ CT\ CT\ ON OO CT\ tn CsJ 



OlOOCOCDOCOCOiriHOJ^CN 
CN) OA OA 0\0>CT>0\£r»COC3MnfN 



3 3 3 3 
]co CO CO CO CO CO | E- 1 CO CO 



QOOOOSCUOOO 
co co co co co[ ;g [co co| ^ [co| 
*<i-ih>>.»-is<;cok-i 
n c: > s s ^ > i j> 



coe-<<:6-'Co<£-e-'6-'«< 






CU fa 


CU 


fa 


fa 


fa a« 


ex. 


-I 


2; 


2; Q 


o 


2; 




2; Q 


2; 


Q 


OS 


CO S3 


2 


u: 




2: 


CO 






cc; os 


OS 


a: 


as 


a: a: 


OS 


<*l 


>-< 


> »-H 




>• 






o 


fa 


< 




> 


> 


> 


> > 


> 






Q Q 




Q 




a q 


Q 


o 




< 


CO fa 


fa 


a. 






> 




> 


>> 






p 


>« >« 






>• 


fa X 








O co 




< 




fa <c 


as 


CO 










> 






PC 






CO 


CO 




co e-» 


> 


CO 



^3 ^ _3 |i»h «C| .-3 «J 
a["sE]a Q W W Q W 

>4 >M >• x fa >« >H ><">T>4 



.o < w s < >hO.O*> 
w o 2; I U U Q i< < o* 

COCJO 1 Q 2; O O fa CO 

I co co co co co co I q [To] I < 
osus:*s:*:u:osoios*5fc£: 



fa fxj 
J fa 



e^os^^os&h^^^ds 

> H > H H *-h |«<| > HH H-t 

ooooouoooo 

i — i » — t i — < ^> 



H-t|_l ^3j> t-H > > H > 



Cx3*cCCO»<COCOCO*<<CO 

cm oo lo o> t — i o •^r lo ur> 
t-h Q« CX, Cu * — i ' — * CXt-f cu ex. 

goooggogoo 
o o o o 



fa fa fa >H fa J >"l|l— 1| fa| 

^ ^ *z\pc os|:*s ^: ^: ^ ^| 



CO<:oO*JOB^OCOCO 

[oooooooooo] 
a; 



Q Q Q Q Q Q 



> > 



t-H >|<|> 1-1 > 1-H 

2; 2: zf<]2: 2: 2: 2: 2: 2: 



x 



a: os a; os 

CO CO CO CO 



OS OS OS OS OS 
CO CO CO CO 



x 

fa 

B 



cococococococococo 

col O OCDOOOOO 
[fa OS OS OS OS OS OS OS OS 

|mmmm[i]mmmm5| 

l&u, cm &4 fa|<[ZTT^o|fa"]o 
ooioou2 3:i2 ct: 
Ocoocoocoocooo 

|wwwuwwwwuwj 



S fa 

3= 2; 

Oh! CO 

2: 23 

>ffa 
a o 



fa fa 



ax 3=: 
2: 2: 



< <c 

OS OS 



co r> 



S2:a:2;2:o.<:>» 

cn<cnW2202 
fa fa _3 fa >h| hh [Z3 fa| 

T^gc j.-q >. >.| o os[>T[ 

W2tOH22UO 

oooocooco8^ 
< < < ^<|o ^ <] o 
OS osjenjos OS oslo co 



|fa -3 

o|<c 
o o 

OS OS 



>><0«-J^3>0 
JH^0H> 

^ ^ fa|Q|fa i-j|offa" 



<c < < <c <[o < < 
ocooioo ofx 

OSOSOSOSOSOSOSOS 



*<*<#<co<:cococo»<co 

> 

X 



J *J J 
2: 2: Spo| 



^ v-q ^ ^ 

2; 2: 2; 2; 2; 



1—1 T-H 

o o 



|o o| <c[o] 

>H fa X CJ 

< £-« &- Eh 
cn 00 in ON 

h a a Di 
0, e b a 
g o o o 
o 



«E-02 cn< 

o|o o o o| < 

•J 6-1 Oh X W CU 

co e-i en e-t > 

-h o> -«^r LO c — V£> 

-H r-l Q< * — I Q* Clf 

e s o e o o 
00 o 



fafafafafafafafafa fa 



OOOOCOOOSOSSxSOS 



^3* cnj 00 ld cn t-h -«^r lo vx> ro 

ea eoooggogooEc-. 

00 o 000 00 




Fig. 9 
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Immunoblotting of C. pneumoniae EB, lane 1-3 heated to 100°C in SDS-sample buffer, 
lane 4-6 unheated. Lane l reacted with rabbit anti C. pneumoniae OMC; lane 2 and 4 
pre-serum; lane 3 and 5 polyclonal rabbit anti pEXl-l fusion protein; lane 6 MAb 26. 1. 
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Fig. 12 
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